SiqwrFect (Qiagesci) and pulse-labeled for 3 hours wifli p'SImetiiionine and [^Qcj^teine. Bolh epitope-tagged 
proteins co-migrate when 20 microliters of 15-fold concentrated serum-free conditioned medium were 
electrophoresed on a polyacrylamide gd (Novex) in sodium dodecyl sulfate san^le buffer (SDS-PAGE). The 
VEGF-E-IgG expression plasmid was constructed by cloning the ORF in front of the human Fc (IgG) sequence. 
Ihe VEGF-E-]^ plasmid was co-transfected wilh Baculogold Baculovmis DNA (Pharmingen) using 
5 Lipofectin (GibcoBRL) into lO* Sf9 ceHs grown in Hink's TNM-FH medium (JRH Biosciences) siqjplemaaed 
with 10% fetal bovine serum. Cells were incubated for 5 days at 28''C. Ihe supernatant was harvested and 
subsequently used for flie first viral amplification by infecting Sf9 cells at an approximate multiplkiily of infection 
(MOI) of 10. Cells were incut^ted for 3 days, then supemataat harvested, and expression of the recombinant 
plasmid determined by binding of 1 ml of supernatant to 30 fd of Protein-A Sepharose CL-4B beads (Pharmacia) 
10 foUow«i by subsequent SDS-PAGE analysis. The first an5)lification svqpematant was used to infect a 500 ml 
spinner culture of Sf9 cells grown in ESF-921 medium (Ej^tression Systens LLC) at an ^approximate MOI of 
0.1. Cells were treated as above, except harvested si]^eniatant was sterile filtered. Specific protein w^ purified 
by bindmg to Protein-A Sepharose 4 Fast Flow (Pharmacia) column. 

O 

15^°^ EXAMPLE 86: North ern Blot Analvses for PRO200 

Blots of human poly(A)-t- RNA from niultq>le adult and fetal tissues and tumor cell lines were obtained 
J fi»m Qontech ^*aIo Alto, CA). Hybridization was carried out using ^P-labeied probes containing die entire 
l2 coding region and washed in 0.1 x SSC, 0.1% SDS at 63 "C. 

VEGF-E mRNA was d^ctable in fetal lung, Mdney, brain, liver and adult heart, placenta, liver, 
20 ]^ skeletal muscle, kidney, and pancreas. VEGF-E mRNA was also found in A549 lung adenocarcinoma and HeLa 
! ll cervical adenocarcinoma cdl lines. 

il EXAMPLE 87: In Site Hvbridization nf ffiiman P ^tal Tissue Sections for PRO2Q0 

Formalin-fixed, paraffin-embedded humanfetal brain, liver, lower limb, small mtestine, thyroid, lymph 
25 node, thymus, stomach, trachea, skin, spleen, spinal cord, adr^ial, placenta, cord, and adult liver, pancreas, 
lung, spleen, lympSi node, adrenal, heart, aorta, and skin were sectioned, deparaffinized, deproteinated in 
proteinase K (20 jug/ml) for 15 minutes at 37°C, and fimher processed for in sita hybridization as described by 
Lu LH and GiUell NA (CeU Vision 1: 169-176, 1994). A [a-^'-P]UTP-labeled antisense riboprobe was generated 
from a PCR product of 980 bp (primers GGCGGAATCCAACCTGAGTAG and 
30 GCGGCTATCCTCCTGTGCTC, SEQ ID NOS: 493 and 494, respectively). The slides were dipped in Kodak 
NTB2 nuclear track emulsion and e;q>osed for 4 weeks. 

VEGF-E mRNA expression nicluded localization at the growth plate region and embracing fetal 
myocytes. 

35 EXAMPLE 88: Mvocvte Hypertrophy Assay for PR02QQ 

Myocytes from neonatal Harlan Sprague Dawley rat heart ventricle (23 days gestation) were plated in 
duplicate at 75000 cells/ml in a 96-well plate. Cells were treated for 48h with 2000, 200, 20, or 2 ng/ml 
VEGF-E-IgG. Myocytes were stained with crystal violet to visualize morphology and scored on a scale of 3 to 
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7, 3 being nonsdnmlated and 7 being fiiU-blown hypertrophy. 

2(X)0ng/ ml and 200 ng/ ml VEGF-E caused hypertropl^, scored asaS. 



EXAMPLE89: CeUProliferatiQn Assay for PRO200 

Mouse embryonic fibroblast C3H10T1/2 cells (ATCC) were grown in 50:50 Ham's F-12: low glucose 
5 DMEM medium containing 10% fetal calf serum (PCS). Cells were plated in diq>licate in a 24-well plate at 
1000, 2000, and 4000 cells/well. After 48 hours, celh were switdied to medium coiitalning 2% PCS and were 
incubated for 72 hours with 200, 800, or 2000 ng/ml VEGF-E or no growth factor added. 

Approximately 1 .5 fold greater number of cells were measured in the presence of 200 ng/ml VEGF-E 
as in its absence, at all three ceU densities. 

10 

EXAMPLE 90: Endothelial Cell Survival Assay for PRO200 

Human umbilical vein mdoflielial cells (HUVEC, Cell Systems) were maintamed in Coiiq>lete Media 
, , (Cell Systems) and plated in triplicate in serum-firee medium (Basic Media from C^Il Systems containii^ 0. 1 % 
Q BSA) at 20,000 cells/well of a 48-well plate. Cells were incubated for 5 days with 200 or 400 ng/ml 
5;=J VECT-E-IgG, 100 ng/ml VEGF, 20 ng/ml basic FGF, or no addition. 

vj Survival was 2-3 times grater widi VEGF-E as con^jared to lade of growth factor addition. VEGF 

and basic FGF were included as i>ositive controls. 

IB 

X^L EXAMPLE 91: Isolation of cD NA Qones Bacn Umo TTntnaii PUCYJU'i 

A proprietary expressed sequence tag (EST) DNA database (LIFESEQ™, Iqc^ 
j: Alto, CA) was searched and an EST (^9^43209) was i^tified which showed homology to the Drosophila Toll 
£7, protein. 

Based on &e EST, a paur of PCR primers (forward and reverse): 
25 TAAAGACCCAGCTGTGACCG (SEQ ID NO:499) 

ATCCATGAGCCTCTGATGGG (SEQ ID NO: 500), and 
a probe: 

ATTTATGTCTCGAGGAAAGGGACTGGTTACCAGGGCAGCCAGTTC (SEQ ID NO: 501) 
were synthesized. 

30 mRNAforconstructionof the cDNAHbraries was isolated from human placenta tissue. ThecDNA 

libraries used to isolate the cDNA clones were constructed by standard methods using commercially available 
reagents such as those from hmtrograi, San Diego, CA (Fast Track 2). The cDNA was primed with oligo dT 
containing a NotI site, Imked with blunt to Sail hemikmased ad^rs, cleaved with Nofl, sized appropriately 
by gel electrophoresis, and cloned in a defined orientation into the cloning vector pCR2.1 (Ihvitrogen, Inc.) 

35 using reagents and protocols fljom Life Technologies, Gaithersburg, MD (Super Script Plasmid System). The 
double stranded cDNA was sized to greater than 1000 bp and the cDNA was cloned into BamHI/NotI cleaved 
vector. pCR2. 1 is a commercially available plasmid, designed for easy cloning of PCR firagnrcnts, that carri^ 
Anq)R and KanR genes for selection, and LacZ gene for blue-white selection. 
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Ea order to screen several libraries for a source of a fuU-leng«ii clone, DNA from the libraries was 
screened by PCR amplification with the PGR primer pair identified above. A positive Kbrary was then used to 
isolate clones encoding flie PR0285 gene using the probe oligonucleotide and one of flie PGR primers. 

A cDNA done was sequraiced in entirety. The entire nucleotide sequence of DNA40021-1154 
(encoding PR0285) is shown in Figure 208 (SEQ ID NO:495). Qone DNA40021-1 154 contains a single open 
readiiig frame with an apparent translational mitiation site at nucleotide positions 61-63 (Figure 208). The 
predicted polypeptide precursor is 1049 amino adds long, including a putative signal peptide at amino add 
positions 1-29, a putative transmembrane domain between amino acid positions 837-860, and a leucine zqjper 
pattern at amino acid positions 132-153 and 704-725, respectively. It is noted that the indicated boundaries are 
approximate, and the actual limits of the indicated regions mi^ differ by a few amino acids. Qone DNA40021- 
1154 has been deposited with ATCC (designation: DNA40021-1154) and is assigned ATCC deposit no.209389. 

Based on a BLAST and FastA sequence alignment analj^is (using flie ALIGN coiapxstex program) of 
the fuU-lenglh sequence is a human analogue of the DrosopMla Toll protein, and is homologous to the following 
^ human ToU proteins: Tofll (DNAX# HSU88540-1. whidi is identical wilh the random sequenced fuU-length 
i cDNA #HUMRSC786-1); Toll2 (DNAX# HSU88878-1); Tbll3 (DNAX# HSU88879-1); and ToM (DNAX# 



= EXAMPLE 92: bolation of cDNA Clones Rnt xiding Human PROMfi 

g A proprietary eqjressedsequem* tag (EST) DISTA database (LffJESECj™^ 

flj -^.^^ was searched and an EST (#694401) was identified which showed homol^ 



p Based on the EST, a pair of PCR primers (fiarward and reverse): 

GCCX3AGACAAAAACGTTCTCC (SEQ ID NO:502) 

25 CATCCATGTTCTCATCCATTAGCC (SEQ ID NO: 503). and 
a probe: 

TCGACAACCrCATGCAGAGCATCAACCAAAGCAAGAAAACAGTATT (SEQ ID NO: 504) 



mRNA for construction of the cDNA Ubraries was isolated from human placenta tissue. This RNA was 
30 used to generate an oligo dT primed cDNA Ubrary in the vector pRK5D using reagents and protocols from Life 
Technologies, Gailhersburg, MD (Super Scr^t Plasmid System). pRK5D is a cloning vector that has an sp6 
transcrqrtion initiation site followed by an Sfil restriction enzyme site preceding the XhoI/NotI cDNA cloning 
sites. The cDNA was primed with oligo dT containing a NotI site, linked with blunt to SaU hemildnased 
adaptors, cleaved with NotI, sized to greater than 1000 bp appropriately by gel electrophoresis, and cloned in 
35 a defined orimtation into XhoI/Notl-cleaved pRKSD. 

In order to screen several libraries for a source of a fidl-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0286 gene using the probe oUgonucIeotide identified above and one of the PCR 
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primers. 

A cDNA clone was sequenced in entirety. The entire nucleotide sequence of DNA42663-1154 
(encoding PR0286) is shown in Figure 210 (SEQ ID NO:497). Qone DNA42663-1 154 contains a single open 
reading frame with an apparent ttanslational initiation site at nucleotide positions 57-59 CFigure 211). The 
predicted polypq)tide precursor is 1041 amino acids long, including a putative signal peptide at amino add 
5 positions 1-26, a potential transmembrane domain at ammo acid positions 826-848, and leucine apper patterns 
at amino acids 130-151. 206-227. 662^, 669-690 and 693-614. respectively. It is noted that the indicated 
boundaries are ^iproximate, and the actual limits of the indicated regions might differ by a few amino acids. 
Clone DNA42663-1 154 has been deposited with ATCC (designation: DNA42663-1 154) and is assigned ATCC 
deposit no. 209386. 

10 Based on a BLAST and FastA sequence alignment analysis (using the ALIGN compiler prog^ 

the fiill-Iengih sequence of PR0286, it is a human analogue of ^DrosophUa Toll protein, and is homologous 
to the foHowiiig human Toll proteins: ToUl (DNAX# HSU88540-1, whidi is identical with the random 
sequenced fidl-length cDNA jmUMRSC786-l); ToI12 (DNAX# HSU88878-1); Toll3 (DNAX# HSU88879-1); 
O andToll4(DNAX#HSU88880-l). 
P 



Z JSXAMPLE93: NF-kB Assav for PR028S and PR0286 

As the Ton proteins signal through flie NF-kB pathway, their biological activity can be tested in an NF- 
;(g kB ass^. In this ass^ Juikat cells are transienfly tiansfected using Lipofectamine reagent (Gibco BRL) 
i=y acconiingtolhemannfecturer'sinstnictions.I^pB2XLucplasmid,con^ 

is contiansfccted with l/cgpSRoNejqpression vector with or without the inse^ 
p For a positive control, cells are ti:eated wilh PMA (phorbol myristyl acetate; 20 ng/ml) and PHA 
(phytohaemaglutinin, 2/tg/ml) for three to four hours. Cells are lysed 2 or 3 days later for measurement of 
luciferase activity using reagents from Promega. 
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EXAMPLE 94; Isolation of cDN A Oones Encoding Human PR0213-1. PRO1330 and PRni449 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Exanq>le 1 above. This cansensus sequence is herein designated DNA28735. Based on the DNA28735 

30 consensus sequence, oligonucleotides were synthesized: 1) to identity by PGR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the fviU-length coding sequence for 
PR0213-1. PRO1330 and/or PR01449. A pair of PGR primers (forward and reverse) were synthesized: 
forward PCR primer 5'-TGGAGCAGCAATATGCCAGCC-3' (SEQID NO:511) 
reverse PGR primer 5'-TTTTCCACTCCTGTCGGGTTGG-3' (SEQ ID NO:512) 

35 Additionally, a synflietic oUgonncleotide hybridization probe was constiucted from flie consensus DNA28735 
sequence whidi had flie following nucleotide sequence: 
hvbridization probe 

5'-GGTGACACTTGCCAGTCAGATGTGGATGAATGCAGTGCTAGGAGGG-3' (SEQ ID NO:513) 
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In orda: to screen several libraries for a source of a Ml-laigth clone, DNA j&xMn the libraries was 
screened by PGR aiiq)lificatiDn wifli the PGR pripjer pair identified above. A positive library was then used to 
isolate clones encoding the PR0213-1. PRO1330 and/or PR01449 gene using the probe oliganudeotide and one 
of file PCS primers. RNA for construction of the cDNA libraries was isolated fix>m human fetal lung tissue. 
DNA sequ^idng of the clones isolated as described above gave the fuU-leogth DNA sequence encoding 
5 PR0213-1, PRO1330 and/or PRO1449[PNA30943-l-l 163-1 (SEQIDNO:505), DNA64907-1163-1 (SEQID 
NO:507) and DNA64908-1 163-1 (SEQ ID NO:509). respectively]. 

The entfarenudeotide sequences corresponding to DNA30943-1-1 163-1 (SEQIDNO:505), DNA64907- 
1163-1 (SEQ ID NO:507) and DNA64908-1 163-1 (SEQ ID NO:509), respectively. DNA30943-1163, 
DNA64907-1163-1 and DNA64908-1 163-1 contain a single opea reading frame with an ^parent translational 
D initiation site at nucleotide positions 336-338, 488-490 and 326-328, respectively, and endmg at tbe stop codon 
at nucleotide positions 1221-1223, 1307-1309 and 1145-1147. respectively (Figures 212. 214 and 216). The 
predicted polypqptide precursor is 295, 273 and 273 amino acids long, respectively (Figures 213, 215 and 217). 
, ^ DNA30943-1-1163-1, DNA64907-1163-1 and DNA64908-1 163-1 have been deposited with ATCC and are 
O assigned ATCC deposit no. 209791, 203242 and 203243, respectively. 
5;== Analysis of the ammo acid sequence of the full-length PR0213-1 polypeptide suggests that a portion 

=.j of it possess significant homology to the human growfli arrest-specific gene 6 protem. More specificaUy, an 
analysis of die Dayhoff database (version 35.45 SwissProt 35) evidenced significant homology between the 
P PR0213 amino add sequence and the followmg Dayhoff sequences, HSMHC3W5^ 

Additional analysis of fbe ammo acid sequence of flie full-length PRO1330 and PR01449 polypeptide 
indicates significant identity with notch4. More specifically, an analysis of the Dayhoff database (version 35. 130 
ru SwissProt 35) evidenced significant identity between PRO1330 anddie foUowmg Dayhoff sequences, D86566_l 
:| and NEL_HUMAN. 
■ 1=1= 

EXAMPLE 95: Isolation of cDNA O nnes Encoding Human Pl?n7Q« 

25 A cDNA isolated m the an^rlase screen described m Example 2 above is herein designated DNA26832 

(Figure 220; SEQ ID NO:516). The sequence of DNA26832 was then used to seardi expressed sequence tag 
(EST) databases. The EST databases included public EST databases (e.g., GenBank) and a proprietary EST 
database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA). The search was performed using the computer 
programBLASTor BLAST2 (Altshuletal.. Methods in Enzvmnlnf^ 266; 469-480 [1996]). Those comparisons 

30 resultmg in a BLAST score of 70 (or in some cases 90) or greater that did not encode proteins were clustered 
and assembled into consensus DNA sequences with the program "phrap" (Phfl Green, University of Washhigton, 
Seattle, Washington; htlp://bozenian.mbt.washhigton.edu/phrap.docs/phrap.htnil). 

A consensus DNA sequence was assembled relative to other EST sequences using phrap. A consensus 
sequoice was detaanined, which was then extMided using repeated cycles of BLAST and phrap to extend the 

35 consensus sequence as far as possible using the sources of EST sequences discussed above. The extended 
assembly sequence was designated DNA35861. Based on the DNA35861 consensus sequence, oligonucleotides 
were synfliesized: 1) to identify by PGR a cDNA Ubraiy that contained the sequence of mterest, and 2) for use 
as probes to isolate a clone of the full-length coding sequence of PR0298. Forward and reverse primers 
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generally range from 20 to 30 nucleotides and are often designed to give a PGR product of about 100-1000 bp 
in length. The probe sequence is typically 40-55 bp in lengflL In some cases, additional oligonucleotides are 
synthesized wbea the consensus sequence is greater than about l-l,5kl^. la order to screen several libraries 
for a iuU-lesogth done, DNA fixnn the Iftraries was screened by PGR amplification, as per Ausubel et al.. 
Current ftotocols in Mo lecular Biology, wilh the PGR primer pair. A positive library was used to isolate clones 
Kicoding the gene of interest using the probe oligonucleoti<te and one of the primer pairs. 

PGR primers (forward and reverse) and a hybridization probe were synthesized: 
forward PGR Primer 1 GAAGGTGATTrCAAAGGTGGGGTG (SEQ ID NO:517) 
forward PGR Primer 2 GGGTCGTATGAAGAATTTGG (SEQ ID NO:518) 
forward PGR Primer 3 AGTGGAAGTGGAGCTGGG (SEQ ID NO:519) 
reverse PGR primer 1 CTnACjcrrnA A ATrrrrr AT &nrrr (SEQ ID NO:520) 

hybridization probe 1 GGGAAAAGGCATTTTGGGAGGAGGAATTGGAATGATGTGTCimTCK^^ 
(SEQIDNO:521) 

In order to screai several libraries for a source of a full-length clone, DNA from the libraries was 
□ screraied by PGR an5)lification with the PGR printer pak identified ab^ A positive library was then used to 
isolate clones eacoding tiie PR0298 gene using the probe oligonucleotide and one of the PGR primers. 

RNA for construction of the cDNA libraries was isolated from human fetal lung tissue (LIB25). The 
cDNA libraries used to isolated the cDNA clones were constincted by standard mefliods using commercially 
available reagents such as those from InviOxjgeii, San Diego, CA. The cDNA was primed with oligo dT 
containing a NotI site, linked with blunt to Sail hemildnased ad^rs, cleaved with NotI, sized appropriately 
^ by gel electrophoresis, and cloned in a ^fined orientation into a suitable cloning vector (such as pRKB or 
I pRKD; pRK5B is a precursor of pRK5D that does not contain flie Sfil site; see, Hohnes et al„ Science. 
253:1278-1280 (1991)) m flie unique Xhol and NotI sites. 

DNA sequencing of flie clones isolated as described above gave flie fiill-lengfli DNA sequence for 
PR0298 (herem designated UNQ261 D)NA39975-1210]) (SEQ ED NO:514), and flie derived protein sequence 
for PR0298 (SEQ ID NO:515). 

The aitire nucleotide sequence of UNQ261 (DNA39975-1210) is shown m Figure 218 (SEQ ID 
NO:514). Qone DNA39975-1210 contains a single open reading frame wifli an apparent translational initiation 
site at nucleotide positions 375-377. The predicted polypeptide precursor is 364 amino acids long. The protem 
contains foar putative transmembrane domains between amino acid positions 36-55 (type n TM), 65-84, 188- 
208, and 229-245, respectively. A putative N-linked glycosylation site starts at amino acid position 253. In 
addition, the following features have been identified in the protein sequence: cAMP- and cGMP-dependent 
protein kinase phosphorylation site, starting at position 8; N-myristoylation sites starting a position 173 and 262, 
respectively; and a ZP domain between amino acid positions 45-60. Qone DNA39975-1210 has been deposited 
with ATGG (April 21, 1998) and is assigned ATGG deposit no.209783. 

EXAMPLE 96 : Isolation of cDNA Clones Encoding Human PR0337 

A cDNA sequence identified in tiie amylase screen described m Example 2 above is herein designated 
DNA42301 (Figure 223, SEQ ID NO:524). The DNA42301 sequence was flien conq>ared to oflier EST 
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sequences using pbrq> as described in Exan^Ie 1 above and a consensus sequ^ice designated herein as 
DNA28761 was identified. Based on this consensus sequence, oligonucleotides were synthesized: 1) to identify 
by PGR a cDNA libraiy fliat contained flie sequence of interest, and 2) for use as probes to isolate a clone of 
the fiiZl-Iength coding sequence. In order to screen several libraries for a source of a full-l«igdi clone, DNA. 
from the libraries was screened by PGR anq)lification with the PGR primer pair identified above. A positive 
5 library was tilien used to isolate clones encoding the PR0337 gene using &e probe oligonucleotide and one of 
tfie PGR primers. RNA for construction of the cDNA libraries was isolated finrn human Mai brain. 

A cDNA done was sequenced in its entirety. The full length nucleotide sequence of DNA433 16-1237 
is shown in Figure 221 (SEQ ID NO:5Zl). Glone DNA433 16-1237 contains a single open readmg frame with 
an appsaxnt transladonal imtiation site at nucleotide positions 134-136 OFigure 221; SEQ ID NO:S22). The 
10 predicted polypepticte precursor is 344 andno acids long. Qone DNA43316-1237 has been d^sited with 
ATGC and is assigned ATCX: deposit no. 209487 

Based on aBLAST-2 andPastA sequoicealignm^ analysis of the fiill-Iength sequence, PR0337 shows 
1^ an±K> acid sequence ideiitity to rat neuiotrimin (97%). 

52 EXAMPLE 97 : Isolation of cDNA Glomes Bicoding Human PRO403 
SJ Introduction : 

Human thrombqpoietin (THPO) is a glycosylated hormone of 352 amino acids consisting of two 
|=& domains. The N-terminal domain, sharing 50% similarity to erydm>poietin, is responsible for the biological 
^ activity. The G-terminal-region is required for secretion. The gene for tbrombopoiedn (THPO) maps to human 
Of ^1 chromosome 3q^7-q28 where tiiie six exons of this gene span 7 kilobase base pairs of genomic DNA (Chang et 
'2 al.. Genomics 26: 636-7 (1995); Foster et al., Proc. Natl. Acad. Sci. USA 91: 13023-7 (1994); Gumey et al., 
JEJ Blood 85: 981-988 (1995). In order to determine whether there were any genes encodJqg THPO homologues 
1=== located in close proximity to THPO, genomic DNA frs^ents from this region were identified and sequenced. 
Three PI clones and one PAG clones (Genome Systrans Inc. , St. Louis, MO; cat. Nos, Pl-2535 and PAG-6539) 
25 «acompassing the THPO locus were isolated and a 140 kb region was sequenced using the ordered shotgun 
strategy (Ghen et al, , Genomics 17: 651-656 (1993)), coupled with a PGR-based gap filling approach. Analysis 
reveals that the region is gene-rich with four additional genes located very close to THPO: tumor necrosis factor- 
receptor type 1 associated protein 2 (TRAP2) and elongation initiation factor gamma (eIF40> chloride channel 
2 (GLCN2) and RNA polymerase II subunit hRPB17. While no THPO homolog was found in the region, four 
30 novel genes have been predicted by computer-assisted gene detection (GRAIL)CXu et al. , Gen. Engin. 16: 241- 
253 (1994), the presence of CpG islands (Cross, S. and Bird, A., Curr. Opin. Genet. & Devel. 5: 109-314 
(1995), and homology to known genes (as detected by WU-BLAST2.0)(Altschul and Gish, Methods Enzymol. 
266: 460-480 (1996) (ht^://bla8Lwusd.eda/blast/READMEJitml). 
Procedures: 
35 PI and PAG clones : 

The initial himian PI clone was isolated from a genomic PI library (Genome Systems Inc., St. Louis, 
MO; cat. no.: Pl-2535) screened with PGR primers designed from the THPO genomic sequence (A.L. Gumey, 
et al. , Blood 85: 98 1-88 (1995). PGR primers were designed fix)m the end sequences derived from this PI clone 



311 



were ihemised to screen PI and PAC libraries (Genome Systems, Cat. Nos.: Pl-2535 & PAC-6539) to identify 
overlq)ping clones (PACl, pl.t, and Pl.u). The 3'-end sequence from PACz was used to define the primers 
vsed for flie screening of a human BAG library (Genome Systems Inc., St. Louis, MO; Cat. No.: BDTW- 
4533A). 

Ordered Shotgun Strategy : 

5 The Ordered Shotgun Strategy (OSS) (Chenet al.. Genomics 17: 651-656 (1993)) involves the m^ing 

and sequencing of large genomic DNA clones with a hierardiical a^^roach. The PI or PAC clone was sonwated 
and the fragments subcloned into lambda vector (XBluestar) (Novagen, Inc., Madison, WI; cat. no. 69242-3). 
Hie lambda subclone inserts were isolated by long-range PGR (Barnes, W. Proc. Natl. Acad. Sci. USA 91: 
2216-2220 (1994) and the ends sequenced. The lambda-end sequences were overlaj^ed to create a partial m^ 
10 ofOte original done. Those lambda clones witii overlapping end-sequences were identified, tfie insets subcloned 
into a plasmid vector ^UC18 or pUC19, Hoefer Pharmacia Biotech, Inc., San Francisco, CA, Cat. Nos. 27- 
4949-01 and 27-4951-01) and the ends of flie plasmid subclones were sequenced and assembled to generate a 
contiguous sequence. Hiis directed sequencing strategy minimizes the redundancy required while allowing one 
13 to scan for and concentrate on interesting reborn. 

> JiT In. order to define better the THPO locus and to search for other genes related to the hematopoietin 

femily, five genomic clones were isolated firom this region by PGR screening of human PI and PAC libraries 
y (Genome Systwn, Inc., Cat Nos.: Pl-2535 and PAC-6539). 

11 The sizes of d« genomic fcagments are as follows: Pl.t is 40 kb; Pl.g is 70 kb; Pl.u is 70 kb; PAG.z is 200 
f kb; and BAC.1 is 80 kb. Approximately 75% (140 kb) of the 190 kb genomic DNA region was sequenced by 
) the Ordered Shotgun Strategy (OSS) (Chen et al. , Genomics 17: 65 1-56 (1993), and assembled into condgs using 
I II AutoAssemblerTM (Apphed Biosystems, Peridn Smer, Fbster City, CA, cat. no. 903227). The preliminary 

order ofthesecontigs was determined by inannal analysis. There were 47 contigs Ae 140 ^ region. A PCR- 
1=^ based sqpproach to ordering the contigs and filling in ibe giaps was en^loyed. The following summarizes the 

number and sizes of the g^s. The 50 kb of sequence unique to BAC.1 was sequenced by a total shotgun 
25 approach widd a ten-fold redundancy. 

Size of gap number 

<50bp 13 

50-150 bp 7 

150-300 bp 7 
30 300-1000 bp 10 

1000-5000 bp 7 

> 5000 bp 2 ((15,000 bp) 

DNA sequencing : 

35 ABI DYE-primerTM chemistry (PE Applied Biosystems, Foster City, CA; Cat. No. : 402112) was used 

to end-sequence the lambda and plasmid subclones. ABI D YE-terminaterTM chemistry (PE Applied Biosystems , 
Foster City, CA, Cat. No: 403044) was used to sequence flie PGR products with their respective PGR primers. 
The sequences were collected with an ABI377 instrument. For PGR products larger than. Ikb, waBdng primers 
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were used. The sequences of contigs genentfed by Ihe OSS strategy m AutoAssembleiTM (PE Applied 
Biosystems, Foster Qty, CA; Cat. No: 903227) and the g^faiing sequencing trace ffles were inqwrted into 
SequencherTM (Gene Codes Corp., Ann Arbor, MS) for overlapping and editing. The sequences generated by 
the total shotgun strate^ were assembled using Fhred and Phraqp and edited using Consed 
Oit«p://chimera.biotech.washingtan.edu/uwgc^rojects.htm) and GFP (Genome Reconstruction Manager for 
Phr^), version 1.2 (htip://stork.cellb.bcm.tmc.edn/g1^y)- 
PCR-Based gap filling Strateev: 

Primers were designed based on the 5'- and 3'-end sequeaiced of each contig, avoiding repetitive and 
low quality sequence regions. All primers were designed to be 19-24-mers with 50-70% G/C content. Oligos 
were synthesized and gel-purified by standard methods. 

Since the orietttation and order of the contigs were unknown, permutations of Hbe primers were used 
in the amplification reactions. Two PCR kits were used: first, XL PGR Mt (Perkin Hmer, Norwalk, CT; Cat. 
No.: N8080205), wilh extension times of proximately 10 minutes; and second, the Taq polymerase PCR kit 
(Qiagen Inc. , Val^cia, CA; Cat. No. : 201223) was used under high stringency conditions if snieared or multiple 
products were observed with flie XL PCR kit. The main PCR product from each successful reaction was 
extracted ftom a 0.9% low melting agarose gel and purified with the Geneclean DNA Purification kit prior to 
sequencing. 
Analysis : 

The identification and diaracterizatioa of coding regions was carried out as follows: First, repetitive 
sequences were masked using RepeatMasker (A.F.A. Smit & P. Green, 
bttp://f^.genome.washington.edu/RM/RM_details.html) which screens DNA sequences in FastA format against 
alibraryof repetitive elen^nts and returm a masked query sequsoce. Repeats not masked were identified by 
con:q>aring die sequence to tbe G^ank database using WUBLAST2.0 [Altschul, S & Gish, W., Methods 
Enzymol. 266: 460-480 (1996); ht^:/Alast.wusd.edu/blast/README.html] and were masked manually. 

Next, known genes were revealed by compsaiag the genomic regions against Genentedi's protein 
database using flie WUBLAST2.0 algorithm and thai annotated by aligning the genomic and cDNA sequences 
for each gene, respectively, using a Needleman-Wundi (Needleman and Wunsdi, J. Mol. Biol. 48: 443-453 
(1970) algoriton to find r^ns of local identity between sequences. The strategy results in detection of all 
exons of the five known genes in flie region, THPO, TRAP2, elF4g, CLCN2 and hRPB17 (see bek>w). 

Map position 
3q27-qter 
3q26-q27 
3q26-qter 

not previously mapped 
not previously taapped 



Known ^enes 

eukaryotic translation initiation factor 4 gamma 

thrombopoietin 

chloride channel 2 

TNF receptor associated protein 2 

RNA polymerase II subunit hRPB17 



Finally, novel transcription units were predicted using a number of approaches . CpG islands (S. Cross 
& Bird, A., Curr. Opin. Genet. Dev. 5: 109-314 (1995) islands were used to define promoter regions and were 
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identified as clusters of sites cleaved by enzymes reco^iizing GC-ridi, 6 or S-mer palindromic sequences (NofI, 
Narl, BssHn, Xhol. CpG islands are usually associated wift. promoter regions of genes. WUBLAST2.0 
analysis of short genomic regions (10-20 Id)) versus GenBank revealed matches to ESTs. The individual EST 
sequences (or where possible, their sequence chromatogram ffles) were retrieved and assemibled with Sequencer 
to provide a theoretical cDNA sequence (DNA36443). GRAIL2 (ApoCom Inc., Knoxvflle, TN, command line 
5 version for flie DEC alpha) was used to predict a novel exon. The five known genes in the region served as 
internal controls for the success of the GRAIL algorithm. 
Isolation ; 

A partial endothelin converting enzyme-2 (ECE-2) cDNA clone was isolated by first splicing in silico 
the ECE-2 exons predicted in the genomic sequence to graierate a putative sequence 0DNA36443). An 
) oKgonucIeotide probe: GAAGCAGTGCAGCCAGCAGTAGAGAGGCACCTGCTAAGA) (SEQ ID NO:530) 
was designed and used to screen a human fetal small intestine library O^HO) and mtemal PCR primers 
(36443fl) (ECE2.f:ACGCAGCTGGAGCTGGTCTTAGCA) (SEQ ID NO:531) and (36443rl) (ECE2.r) 
(GGTACTGGACCCCTAGGGCCACAA) (SEQ roiS[0:532) were used to confirm clones hybridize 
Q probe prior to sequencing. One positive done was obtained, however fliis dDNA (PNA49830) represented a 
5;^ partially spliced transoipt containing sppropriately spliced exons 1 through 6, followed by intron 6 sequence. 
"=4 IlieoligodTpriiner annealed to a polyA-str^ch within an Aluelementpresent in intron 6. An additional ECE-2 
T: cDNA fiagment (DNA4983 1) was obtained by PGR from a human fetal kidney library (LIB227) with primffics 
p designed from fee presumed cDNA sequence |36443f3: CCTCCCAGCCGAGACCAGTGG (SEQ ID NO:533) 
f and 36443r2: GGTCCTATAAGGGCCAAGACC (SEQ ID NO:534)]. This PCR product extended fixjm exon 
3q 13 into fee 3' untranslated regioa in exon 18. 

I'U A full lengfe endothelin converting enzyme 2 (ECE-^ cVHA done (DNA5S800-1263) was isolated 

' from an oligo-dT-primed human fetal brain library. RNA from human fetal brain tissue (20 weds gestation, 
h& j5283005)(SRC175) was isolated by guaiudine thiocyanate and S fig used to gooerate double stranded cDNA 
which was cloned into the vector pRK5E. The 3' -primer 
25 (pGACTAGTTCTAGATCGCGAGCGGCCGCCCTTTTTTTTTTTTTTr) (SEQ ID NO:535) and fee 5 -linker 
(pCGGACGCGTGGGTCGA) (SEQ ID NO:536) were designed to introduce Xhol and NotI restriction sites. 
Tbe library was screened wife PCRprimers [36443pcrfl: CGGCCGTGATGGCTGGTGACG (SEQID NO:537) 
and 36443r3: GGCAGACTCCTTCCTATGGG (SEQ ID NO:538)] designed from fee partial human ECE-2 
cDNA sequences (DNA49830 and DNA4983I). PCR products were cloned into fee vector pCR2.1-TOPO 
30 (Invitrogen Corp., Carlsbad, CA, Cat. No. K4500-01) and sequenced wife DYE-terminator chemistry as 
described above. 

EXAMPLE 98 : Norfeem Blot and in situ RNA Hvbridization Analvsis for PRO403 

Expression of PRO403 mRNA in human tissues was examined by Norfeem blot analysis. Human 
35 polyA+ RNA blots derived from human fetal and adult tissues (Clontech, Palo Alto, CA; Cat. Nos. 7760-1, 
7756-1 and 7755-1) were hybridized to a [32P-a]dATP-labelled cDNA fragments from probe based on fee full 
lengfe PRO403 cDNA. Blots were incubated wife fee probes in hybridization buffer (5X SSPE; 2X Denhardt's 
solution; 100 mg/mL denatured sheared sahnon sperm DNA; 50% formamide; 2% SDS) for 18 hours at 42°C, 
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washed to high stringecugr (0. IXSSC, 0. 1 % SDS, 50°Q and autoradiographed. The blots were developed after 
overnight ensure by phosphorimager analysis (Fuji). 

PRO403 mRNA transcripts were d^ected. Analysis of the ejpression pattern showed the strongest 
signal of Iliee^ected33 Id) transcript in adult brain (U^est in die cerebeUum,pu^^ mitenspoTal 
lobe, and lower in the cerebral cortex, occipital lobe and frontal lobe), spinal cord, lung and pancreas aiKi higher 
levels of a 4.S Wo transcript in fetal brain and Mdney. 

EXAMPLE 99: Use of PRO Polvpeptide-Encoding Nucleic Acid as Hybridization Probes 

The following mefliod describes use of a nucleotide sequence encoding a PRO polypq>tide as a 
hybridi2ation probe. 

DNA oon^»rising the coding sequence of of a PRO polypeptide of interest as disclosed herein may be 
employed as a probe or used as a basis ftom which to prepare probes to screen for homologous DNAs (such as 
those encoding naturally-occurring variants of the PRO polypeptide) in human tissue cDNA libraries or human 
tissue genomic libraries. 

Hybridization and washing of filters containing eiflier library DNAs is performed under the following 
high stringency conditions. HybridizationofradiolabeledPROpolypeptide-encodingnucleicacid-derivedprobe 
to the filters is performed in a solution of 50% formamide, 5x SSC, 0.1% SDS, 0. 1 % sodium pyrophosphate, 
50 mM sodium phosphate, pH 6.8, 2x Denhardt's solution, and 10% dralran sulfate at 42»C for 20 hours. 
Washing of the filters is praformed in an aqueous solution of O.lx SSC and 0.1% SDS at 42°C. 

DNAs having a dcisired sequ^ice identity with the DNA encoding fiill-len^ native sequence PRO 
polypeptide can then be idmtified using standard techniques known in tibe art 

EXAMPLE 100: Expression of PRO Polv neotides m E. coli 

This exan^le illustrates preparation of an unglycosylated form of a desired PRO polypqptide by 
recombinant ej^ression in E. coli. 

The DNA sequence encoding the desired PRO polypeptide is initiaUy amplified using selected PGR 
primers. The primers should contain restriction raizyme sites which correspond to the restriction enzyme sites 
on tiie selected egression vector. A variety of expression vectors may be enq>loyed. An example of a suitable 
vector is pBR322 (derived fiiom E, coli; see Bolivar et al.. Gene. 2:95 (1977)) which contains genes for 
an^icillin and tetracycline resistance. The vector is digested with restriction enzyme and dephosphorylated. 
The PGR amplified sequences are then ligated into the vector. The vector will preferably include sequences 
which encode for an antibiotic resistance gene, a tip promoter, a polyhis leader (including the first six STU 
codons, polyhis sequence, and enterokinase cleavage site), the specific PRO polypeptide codmg region, lambda 
transcriptional terminator, and an argU gene. 

The ligation mixture is then used to transform a selected E. coli strain using the methods described in 
Sambrook et al. , supra . Transformants are identified by their ability to grow on LB plates and antibiotic resistant 
colonies are thai selected. Plasmid DNA can be isolated and confirmed by restriction analysis and DNA 
sequencing. 
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Selected clones can be gtovm ovemi^t in liquid culture medium such as LB broth siq^lemented with 
antibiotics, llie ovemi^ culture may subsequetxtly be used to inoculate a larger scale cu^^ The cells are 
then grown to a desired optical density, during which the egression promoter is turned on. 

After culturing fte cells for several more houra, the cells can be harvested by centrifiigation. The cell 
pellet obtained by the centrifiigation can be solubilized using various agents known in ttie art, and the solubilized 
5 PRO polypeptide can thrai be purified usmg a metal dielating column under conditions that allow tight binding 
of fee protein. 

PR0181, PR0195, PRO200, PR0237, PR0273, PRO540, PR0322. PRO1017, PR0938, PR0162, 
PR01114, PR0827 and PRO1008 were e^nessed in E. coU in a poly-His tagged form, using the following 
procedure. The DNA encodit^ the PRO polypq)tide was initially anq)lified using selected PCR pnmers. The 
0 primers contained restriction en27^e sites which correspond to the restriction eaiyme sites on the selected 
e:qH:ession vector, and other useful sequences providing for efficiait and reliable translation initiation, r^id 
purification on a metal chelation column, and proteolytic ronoval with enterokinase. Ihe PCR-an^lified, poly- 
=^ His ta^ed sequ«ices were then ligated into an egression vector, vMch was used to transform an E. coU host 

0 based on strain 52 (W3110fuhA(toiiA) Ion gaffiipoHtsO^^ Trans&crmants were first grown 
^ in LB containing 50 mg/mlcart)enicilliQ at 30''C with shaldng until an O.D.6OO0 Cultures 

SJ were thai diluted 50-100 fold into CRAP media (prepared by mixing 3.57 g (Na,)2S04, 0.71 g sodium 
y citrate'2H20, 1.07 gKQ, 5.36 gDifco yeast extract, 5.36 g Sheffield hycaseSF in 500 mL water, as well as 
|1I 110 mM MPOS, pH 7.3, 0.55% (w/v) glucose and 7 mM M^O^ and grown for approxnnately 20-30 hours 
atSO'Cwithshakmg. San^les were removed to verify expression by SDS-PAGE analysis, and fee bulk culture 
lQ=i is centrifuged to pellet the cells. Cell pellets were ftozen until purification and refolding. 

1 U E.coU paste finm 0.5 to ILfennentations (6-10 g pellets) was resuspended in 10 vohimesC^^ 

|: M guamdme, 20 mM Tris, pH 8 bu^r. SoUd sodhmi sulfite and sodium tetratbionate is added to noake final 
\^ concentrations of O.IM and 0.02 M, respectively, and the solution was stirred overnight at 4°C. This stq) 
results m a doiatured protein wife all cysteine residues blocked by sulfitolization. The solution was centrifuged 

25 at40,000rpminaBedananUltracentifugefor30min. The siqieniatant was diluted wife 3-5 volumes of metal 
chelate column buffer (6 M guanidine, 20 mM Tris, pH 7.4) and filtered fhrou^ 0.22 micron filters to clarify. 
Depending the clarified extract was loaded onto a 5 ml Qiagen Ni-NTA metal dielate column equilibrated in fee 
metal dielate column buffer. The column was washed wife additional buffer containmg 50 mM imidazole 
(Calbiochem, Utrol grade), pH 7.4. The protein was duted wife buffer containmg 250 mM imidazole. 

30 Fractions containing fee desired protein were pooled and stored at 4*'C. Protein concentration was estimated 
by its absorbance at 280 nm using the calculated extinction coefScient based on its amino acid sequence. 

The proteins were refolded by diluting sample slowly into freshly prepared refolding buffer consisting 
of: 20 mM Tris, pH 8.6, 0.3 M NaCl, 2.5 M urea, 5 naM cysteine, 20 mM ^ycine and 1 mM EDTA. 
Refolding volumes were chosen so feat the final protein concentration was between 50 to 100 micrograms/ml. 

35 The refolding solution was stkred gently at 4*'C for 12-36 hours. The refolding reaction was quenched by fee 
addition of TFA to a final concentiration of 0.4% (pH of ^proximately 3). Before fortfaer purification of fee 
protein, fee sohition was filtered tiirough a 0.22 micron filter and acetonitiile was added to 2-10% final 
concentration. The refolded protein was chromatogrq)hed on a Poros Rl/H reversed phase column using a 



316 



FIGURE 89 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA50919 
xsiibunit 1 of 1, 472 aa, 1 stop 
xMW: 53847, pi: 5.75, NX(S/T) : 2 

MSNiyiQEPPTNGKVLLKTTAGDIDIELWSKEAPKACRNFIQLCLEAYYDNTIFHRWPGFI 
VQGGDPTGTGSGGESIYGAPFKDEFHSRLRFinyRGLVAMaNAGSHDNGSQFFFTI.GRADELN 
NKHTIFGKVTGDTVYNMLRLSEVDIDDDERPHNPHKIKSCEVLFNPFDDIIPKEIKRLKKEK 
PEEEVKKLKPKGTKNFSLLSFGEEAEEEEEEWRVSQSMKGKSKSSHDLLKDDPHLSSVPW 
ESEKGDAPDLVDDGEDESAEHDEYIDGDEKNLMRERIAKKLKKDTSANVKSAGEGEVEKKSV 
SRSEELRKEARQLKRELLAAKQKKVENAAKQAEKRSEEEEAPPDGAVAEYRREKQKYEALRK 
QQSKKGTSREDQTLALLNQFKSKLTQAIAETPENDIPETEVEDDEGWMSHVLQFEDKSRKVK 
DASMQDSDTFEIYDPRNPVNKRRREESKKLMREKKEKR 

Isiportant features: 
Signal peptide: 
amino acids 1-21 

N-glycosylation sites. 

amino acids 109-112 and 201-204 

Cyclophilin-type pep tidyl -prolyl cis-trans isomerase signature, 
amino acids 49-66 

Homologous region to Cyclophilin-type pep tidyl -prolyl cis-trans 
isomerase 

amino acids 96-140, 49-89 and 22-51 
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CGCCGCCGTTGGGGCTGGAAGTTCCCGCCAGGTCCGTGCCGGGCGAGAGAGATGCTGCCCGG 

CCCGCCTCGGCTTTGAGGCGAGAGAAGTGTCCCAGACCCATTTCGCCTTGCTGACGGCGTCG 

AGCCCTGGCCAGACATGTCCACAGGGTTCTCCTTCGGGTCCGGGACTCTGGGCTCCACCACC 

GTGGCCGCCGGCGGGACCAGCACAGGCGGCGTTTTCTCCTTCGGAACGGGAACGTCTAGCAA 

CCCTTCTGTGGGGCTCAATTTTGGAAATCTTGGAAGTACTTCAACTCCAGCAACTACATCTG 

CTCCTTCAAGTGGTTTTGGAACCGGGCTCTTTGGATCTAAACCTGCCACTGGGTTCACTCTA 

GGAGGAACAAATACAGGTGCCTTGCACACCAAGAGGCCTCAAGTGGTCACCAAATATGGAAC 

CCTGCAAGGAAAACAGATGCATGTGGGGAAGACACCCATCCAAGTCTTTTTAGGAGTCCCCT 

TCTCCAGACCTCCTCTAGGTATCCTCAGGTTTGCACCTCCAGAACCCCCGGAGCCCTGGAAA 

GGAATCAGAGATGCTACCACCTACCCGCCTGGATGGAGTCTCGCTCTGTCGCCAGGCTGGAG 

TGCAGTGGCACGATCTCGGCTCACTGCAACCTCCGCCTCCCGGGTTCAAGCGAGTCTCCTGC 

CTCAGCCTCTGAGTGTCTGGGGCTACAGGTGCCTGCAGGAGTCCTGGGGCCAGCTGGCCTCG 

ATGTACGTCAGCACGCGGGAACGGTACAAGTGGCTGCGCTTCAGCGAGGACTGTCTGTACCT 

GAACGTGTACGCGCCGGCGCGCGCGCCCGGGGATCCCCAGCTGCCAGTGATGGTCTGGTTCC 

CGGGAGGCGCCTTCATCGTGGGCGCTGCTTCTTCGTACGAGGGCTCTGACTTGGCCGCCCGC 

GAGAAAGTGGTGCTGGTGTTTCTGCAGCACAGGCTCGGCATCTTCGGCTTCCTGAGCACGGA 

CGACAGCCACGCGCGCGGGAACTGGGGGCTGCTGGACCAGATGGCGGCTCTGCGCTGGGTGC 

AGGAGAACATCGCAGCCTTCGGGGGAGACCCAGGAAATGTGACCCTGTTCGGCCAGTCGGCG 

GGGGCCATGAGCATCTCAGGACTGATGATGTCACCCCTAGCCTCGGGTCTCTTCCATCGGGC 

CATTTCCCAGAGTGGCACCGCGTTATTCAGACTTTTCATCACTAGTAACCCACTGAAAGTGG 

CCAAGAAGGTTGCCCACCTGGCTGGATGCAACCACAACAGCACACAGATCCTGGTAAACTGC 

CTGAGGGCACTATCAGGGACCAAGGTGATGCGTGTGTCCAACAAGATGAGATTCCTCCAACT 

GAACTTCCAGAGAGACCCGGAAGAGATTATCTGGTCCATGAGCCCTGTGGTGGATGGTGTGG 

TGATCCCAGATGACCCTTTGGTGCTCCTGACCCAGGGGAAGGTTTCATCTGTGCCCTACCTT 

CTAGGTGTCAACAACCTGGAATTCAATTGGCTCTTGCCTTAT.?iATATCACCAAGGAGCAGGT 

ACCACTTGTGGTGGAGGAGTACCTGGAC2^TGTCAATGAGCATGACTGGAAGATGCTACGAA 

ACCGTATGATGGACATAGTTCAAGATGCCACTTTCGTGTATGCCACACTGCAGACTGCTCAC 

TACCACCGAGAAACCCCAATGATGGGAATCTGCCCTGCTGGCCACGCTACAACAAGGATGAA 

AAGTACCTGCAGCTGGATTTTACCACAAGAGTGGGCAIGAAGCTCAAGGAGAAGAAGATGGC 

TTTTTGGATGAGTCTGTACCAGTCTCAAAGACCTGAGAAGCAGAGGCAATTCTAAGGGTGGC 

TATGCAGGAAGGAGCCAAAGAGGGGTTTGCCCCCACCATCCAGGCCCTGGGGAGACTAGCCA 

TGGACATACCTGGGGACAAGAGTTCTACCCACCCCAGTTTAGAACTGCAGGAGCTCCCTGCT 

GCCTCCAGGCCAAAGCTAGAGCTTTTGCCTGTTGTGTGGGACCTGCACTGCCCTTTCCAGCC 

TGACATCCCATGATGCCCCTCTACTTCACTGTTGACATCCAGTTAGGCCAGGCCCTCTCAAC 

CCCACCCTTCAATGTCTCCTTGTGACTCCTTCTTATGGGAGGTCGACCCAGACTGCCACTGC 
CCCTGTCACTGCACCCAGCTTGGCATTTACCATCCATCCTGCTCAACCTTGTTCCTGTCTGT 
TCACATTGGCCTGGAGGCCTAGGGCAGGTTGTGACATGGAGCAAACTTTTGGTAGTTTGGGA 
TCTTCTCTCCCACCCACACTTATCTCCCCCAGGGCCACTCCAAAGTCTATACACAGGGGTGG 
TCTCTTCAATAAAGAAGTGTTGATTAGAAAAA?\AAAA?l 
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</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA44179 
<subunit 1 of 1, 545 aa, 1 stop 
<MW: 58934, pi: 9.45, NX(S/T) : 4 

MSTGFSFGSGTLGSTTVAAGGTSTGGVFSFGTGTSSNPSVGLNFGNLGSTSTPATTSl^SSG 
FGTGLFGSKPATGFTLGGTNTGALHTKRPQWTKYGTLQGKQMHVGKTPIQVFLGVPFSRPP 
LGILRFAPPEPPEPWKGIRDATTYPPGWSLALSPGWSAVARSRLTATSASRVQASLLPQPLS 
WGYRCLQESWGQLASMYVSTRERYICWLRFSEDCLYLNVYAPARAPGDPQLPVMVWFPGGAF 
IVGAASSYEGSDIAAREKVVLVFLQHRLGIFGFLSTDDSHARGmGLLDQMAALRWVQENIA 
AFGGDPGim'LFGQSAGAMSISGLmSPLASGLFHRAISQSGTALFRLFITSNPLKVAKKVA 
HLAGCNHNSTQILWCLRALSGTKVMRVSNKMRFLQLNFQRDPEEIIWSMSPVVDGVVIPDD 
PLVLLTQGKVSSVPYLLGVNNLEFNWLLPYNITKEQVPLWEEYLDNVNEHDWKMLRNRMMD 
IVQDATFVYATLQTAHYHRETPMMGICPAGHATTRMKSTCSWILPQEWA 

Important features: 
Signal peptide: 

amino acids 1-29 

Carboxylesterases type-B serine active site, 
amino acids 312-327 

Carboxylesterases type-B signature 2. 

amino acids 218-228 

N-glycosylation sites. 

amino acids 318-321, 380-383 and 465-468 
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GAGAACAGGCCTGTCTCAGGCAGGCCCTGCGCCTCCTATGCGGAGATGCTACTGCCACTGCT 
GCTGTCCTCGCTGCTGGGCGGGTCCCAGGCTATGGATGGGAGATTCTGGATACGAGTGCAGG 
AGTCAGTGATGGTGCCGGAGGGCCTGTGCATCTCTGTGCCCTGCTCTTTCTCCTACCCCCGA 
CAAGACTGGACAGGGTCTACCCCAGCTTATGGCTACTGGTTCAAAGCAGTGACTGAGACAAC 
CAAGGGTGCTCCTGTGGCCACAAACCACCAGAGTCGAGAGGTGGAAATGAGCACCCGGGGCC 
GATTCCAGCTCACTGGGGATCCCGCCAAGGGGAACTGCTCCTTGGTGATCAGAGACGCGCAG 
ATGCAGGAT6AGTCACAGTACTTCTTTCGGGTGGAGAGAGGAAGCTATGTGACATATAATTT 
CATGAACGATGGGTTCTTTCTAAAAGTAACAGTGCTCAGCTTCACGCCCAGACCCCAGGACC 
ACAACACCGACCTCACCTGCCATGTGGACTTCTCCAGJy^GGGTGTGAGCGCACAGAGGACC 
GTCCGACTCCGTGTGGCCTATGCCCCCAGAGACCTTGTTATCAGCATTTCACGTGACAACAC 
GCCAGCCCTGGAGCCCCAGCCCCAGGGAAATGTCCCATACCTGGAAGCCCAAAAAGGCCAGT 
TCCTGCGGCTCCTCTGTGCTGCTGACAGCCAGCCCCCTGCCACACTGAGCTGGGTCCTGCAG 
AACAGAGTCCTCTCCTCGTCCCATCCCTGGGGCCCTAGACCCCTGGGGCTGGAGCTGCCCGG 
GGTGAAGGCTGGGGATTCAGGGCGCTACACCTGCCGAGCGGAGAACAGGCTTGGCTCCCAGC 
AGCGAGCCCTGGACCTCTCTGTGCAGTATCCTCCAGAGAACCTGAGAGTGATGGTTTCCCAA 
GCAAACAGGACAGTCCTGGAAAACCTTGGGAACGGCACGTCTCTCCCAGTACTGGAGG6CCA 
AAGCCTGTGCCTGGTCTGTGTCACACACAGCAGCCCCCCAGCCAGGCTGAGCTGGACCCAGA 
GGGGACAGGTTCTGAGCCCCTCCCAGCCCTCAGACCCCGGGGTCCTGGAGCTGCCTCGGGTT 
CAAGTGGAGCACGAAGGAGAGTTCACCTGCCACGCTCGGCACCCACTGGGCTCCCAGCACGT 
CTCTCTCAGCCTCTCCGTGCACTATAAGAAGGGACTCATCTCAACGGCATTCTCCAACGGAG 
CGTTTCTGGGAATCGGCATCACGGCTCTTCTTTTCCTCTGCCTGGCCCTGATCATCATGAAG 
ATTCTACCGAAGAGACGGACTCAGACAGAAACCCCGAGGCCCAGGTTCTCCCGGCACAGCAC 
GATCCTGGATTACATCAATGTGGTCCCGACGGCTGGCCCCCTGGCTCAGAAGCGGAATCAGA 
AAGCCACACCAAACAGTCCTCGGACCCCTCCTCCACCAGGTGCTCCCTCCCCAGl^TCAAAG 
AAGAACCAGAAAAAGCAGTATCA6TTGCCCAGTTTCCCAGAACCCAAATCATCCACTCAAGC 
CCCAGAATCCCAGGAGAGCCAAGAGGAGCTCCATTATGCCACGCTCAACTTCCCAGGCGTCA 
GACCCAGGCCTGAGGCCCGGATGCCCAAGGGCACCCAGGCGGATTATGCAGAAGTCAAGTTC 
CAATGAGGGTCTCTTAGGCTTTAGGACTGGGACTTCGGCTAGGGAGGAAGGTAGAGTAAGAG 
GTTGAAGATAACAGAGTGCAAAGTTTCCTTCTCTCCCTCTCTCTCTCTCTTTCTCTCTCTCT 
CTCTCTTTCTCTCTCTTTTAAAAZ^CATCTGGCCAGGGC^CAG 

CCAGCACTTTGGGAGGTTGAGGTGGGCAGATCGCCTGAGGTCGGGAGTTCGAGACCAGCCTG 
GCCAACTTGGTGAAACCCCGTCTCTACTAAiyiATACAAAAATTAGCTGGGCATGGTGGCAGG 
CGCCTGTAATCCTACCTACTTGGGAAGCTGAGGCAGGAGAATCACTTGAACCTGGGAGACGG 
AGGTTGCAGTGAGCCAAGATCACACCATTGCACGCCAGCCTGGGCAACAAAGCGAGACTCCA 
TCTCAAAAAAAAAATCCTCCAAATGGGTTGGGTGTCTGTAATCCCAGCACTTTGGGAGGCTA 
AGGTGGGTGGATTGCTTGAGCCCAGGAGTTCGAGACCAGCCTGGGCAACATGGTGAAACCCC 
ATCTCTACAAAAAATACAAAACATAGCTGGGCTTGGTGGTGTGTGCCTGTAGTCCCAGCTGT 
CAGACATTTAAACCAGAGCAACTCCATCTGGAATAGGAGCTGAATAAAATGAGGCTGAGACC 
TACTGGGCTGCATTCTCAGACAGTGGAGGCATTCTAAGTCACAGGATGAGACAGGAGGTCCG 
TACAAGATACAGGTCATAAAGACTTTGCTGATAAAACAGATTGCAGTAAAGAAGCCAACCAA 
ATCCCACCAAAACCAAGTTGGCCACGAGAGTGACCTCTGGTCGTCCTCACTGCTACACTCCT 
GACAGCACCATQACAGTTTACAAATGCCATG6CAACATCAGGAAGTTACCCGATATGTCCCA 
AAAGGGGGAGGl^TGAATAATCCACCCCTTGTTTAGCJiAATAAGCAAGAAATAACCATAAAA 
GTGGGCAACCAGCAGCTCTAGGCGCTGCTCTTGTCTATGGAGTAGCCATTCTTTTGTTCCTT 
TACTTTCTTAATAAACTTGCTTTCACCTTAAAAAAA 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA54002 
xsubunit 1 of 1/ 544 aa, 1 stop 
xMW: 60268, pi: 9.53, NX(S/T): 3 

MLLPLLLSSLLGGSQAMDGRFWIRVQESVMVPEGLCISVPCSFSYPRQDWTGSTPAYGYWFK 
AVTETTKGAPVATNHQSREVEMSTRGRFQLTGDPAKGNCSLVIRDAQMQDESQYFFRVERGS 
YVTYNFMNDGFFLKVTVLSFTPRPQDHNTDLTCHVDFSRKGVSAQRTVRLRVAYAPRDLVIS 
I SRDNTPALE PQPQGNVPYLEAQKGQFLRLLCAADSQPPATLS WVLQNRVLSSSHPWGPRPL 
GLELPGVKAGDSGRYTCRAENRLGSQQRALDLSVQYPPENLRVMVSQANRTVLENLGNGTSL 
PVLEGQSLCLVCVTHSSPPARLSWTQRGQVLSPSQPSDPGVLELPRVQVEHEGEFTCHARHP 
LGSQHVSLSLSVHYKKGLISTAFSNGAFLGIGITALLFLCLALIIMKILPKRRTQTETPRPR 
FSRHSTILDYINWPTAGPLAQKRNQKATPNSPRTPPPPGAPSPESKKNQKKQYQLPSFPEP 
KSSTQAPESQESQEELHYATLNFPGVRPRPEARMPKGTQADYAEVKPQ 

ln^ortant features: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 
amino acids 399-418 

N-glycosylation site. 

amino acids 100-103, 297-300 and 306-309 

Immijnoglobulins and major histocompatibility complex proteins 
signature. 

amino acids 365-371 
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TGAAGAGTAATAGTTGGAATCAAAAGAGTCAACGCAATGAACTGTTATTTACTGCTGCGTTT 
TATGTTGGGAATTCCTCTCCTATGGCCTTGTCTTGGAGCAACAGAAAACTCTCAAACAAAGA 
AAGTCAAGCAGCCAGTGCGATCTCATTTGAGAGTGAAGCGTGGCTGGGTGTGGAACCAATTT 
TTTGTACCAGAGGAAATGAATACGACTAGTCATCACATCGGCCAGCTAAGATCTGATTTAGA 
CAATGGAAACAATTCTTTCCAGTACAAGCTTTTGGGAGCTGGAGCTGGAAGTACTTTTATCA 
TTGATGAAAGAACAGGT6ACATATATGCCATACAGAAGCTTGATAGAGAGGAGCGATCCCTC 
TACATCTTAAGAGCCCAGGTAATAGACATCGCTACTGGAAGGGCTGTGGAACCTGAGTCTGA 
GTTTGTCATCAAAGTTTCGGATATCAATGACAATGAACCAAAATTCCTAGATGAACCTTAT6 
AGGCCATTGTACCAGAGATGTCTCCAGAAGGAACATTAGTTATCCAGGTGACAGCAAGTGAT 
GCTGACGATCCCTCAAGTGGTAATAATGCTCGTCTCCTCTACAGCTTACTTCAAGGCCAGCC 
ATATTTTTCTGTTGAACCAACAACAGGAGTCATAAGAATATCTTCTAAAATGGATAGAGAAC 
TGCAAGATGAGTATTGGGTAATCATTCAAGCCAAGGACATGATTGGTCAGCCAGGAGCGTTG 
TCTGGAACAACAAGTGTATTAATTAAACTTTCAGATGTTAATGACAATAAGCCTATATTTAA 
AGAAAGTTTATACCGCTTGACTGTCTCTGAATCTGCACCCACTGGGACTTCTATAGGAACAA 
TCATGGCATATGATAATGACATAGGAGAGAATGCAGAAATGGATTACAGCATTGAAGAGGAT 
GATTCGCAAACATTTGACATTATTACTAATCATGAAACTCAAGAAGGAATAGTTATATTAAA 
AAAGAAAGTGGATTTTGAGCACCAGAACCACTACGGTATTAGAGCAAAAGTTAAAAACCATC 
ATGTTCCTGAGCAGCTCATGAAGTACCACACTGAGGCTTCCACCACTTTCATTAAGATCCAG 
GTGGAAGATGTTGATGAGCCTCCTCTTTTCCTCCTTCCATATTATGTATTTGAAGTTTTTGA 
AGAAACCCCACAGGGATCATTTGTAGGCGTGGTGTCTGCCACAGACCCAGACAATAGGAAAT 
CTCCTATCAGGTATTCTATTACTAGGAGCAAAGTGTTCAATATCAATGATAATGGTACAATC 
ACTACAAGTAACTCACTGGATCGTGAAATCAGTGCTTGGTACAACCTAAGTATTACAGCCAC 
AGAAAAATACAATATAGAACAGATCTCTTCGATCCCACTGTATGTGCAAGTTCTTAACATCA 
ATGATCATGCTCCTGAGTTCTCTCAATACTATGAGACTTATGTTTGTGAAAATGCAGGCTCT 
GGTCAGGTAATTCAGACTATCAGTGCAGTGGATAGAGATGAATCCATAGAAGAGCACCATTT 
TTACTTTAATCTATCTGTAGAAGACACTAACAATTCAAGTTTTACAATCATAGATAATCAAG 
ATAACACAGCTGTCATTTTGACTAATAGAACTGGTTTTAACCTTCAAGAAGliACCTGTCTTC 
TACATCTCCATCTTAATTGCCGACAATGGAATCCCGTCACTTACAAGTACAAACACCCTTAC 
CATCCATGTCTGTGACTGTGGTGACAGTGGGAGCACACAGACCTGCCAGTACCAGGAGCTTG 
TGCTTTCCATGGGATTCAAGACAGaAGTTATCATTGCTATTCTCATTTGCATTATGATCATA 
TTTGGGTTTATTTTTTTGACTTTGGGTTTAAAACZ^CGGAGAAAAO^GATTCTATTTCCTGA 
GAAl^GTGAAGATTTCAGAGAGAATATATTCCAATATGATGATGAAGGGGGTGGAGAAGAAG 
ATACAGAGGCCTTTGATATAGCAGAGCTGAGGAGTAGTACCATAATGCGGGAACGCAAGACT 
CGGAAAACCACAAGCGCTGAGATCAGGAGCCTATACAGGCAGTCTTTGCAAGTTGGCCCCGA 
CAGTGCCATATTCAGGAAATTCATTCTGGAAAAGCTCGAAGAAGCTAATACTGATCCGTGTG 
CCCCTCCTTTTGATTCCCTCCAGACCTACGCTTTTGAGGGAACAGGGTCATTAGCTGGATCC 
CTGAGCTCCTTAGAATCAGCAGTCTCTGATCAGGATGAAAGCTATGATTACCTTAATGAGTT 
GGGACCTCGCTTTAAAAGATTAGCATGCATGTTTGGTTCTGCAGTGCAGTCAAATAA TTAGG 
GCTTTTTACCATCAAAATTTTTAAAAGTGCTAATGTGTATTCGAACCCAATGGTAGTCTTAA 
AGAGTTTTGTGCCCTGGCTCTATGGCG6GGAAAGCCCTAGTCTATGGAGTTTTCTGATTTCC 
CTGGAGTAAATACTCCATGGTTATTTTAAGCTACCTACATGCTGTCATTGAACAGAGATGTG 
GGGAGAAATGTAAACAATO^GCTCACAGGCATCAATACaACCAGATTTGAAGTAAAATAATG 
TAGGAAGATATTAAAAGTAGATGAGAGGACACAAGATGTAGTCGATCCTTATGCGATTATAT 
CATTATTTACTTAGGAAAGAGTAAAAATACCAAACGAGAZyyVTTTAAAGGAGCAAAAATTTG 
CAAGTCAAATAGAAATGTACAAATCGAGATAACATTTACATTTCTATCATATTGACATGAAA 
ATTGAAAATGTATAGTCAGAGAAATTTTCATGAATTATTCCATGAAGTATTGTTTCCTTTAT 
TTAAA 



FIGURE 95 



></usr/seqdb2/sst/DNA/Dnaseqs -min/ss .DNA53906 
xsubunit 1 of 1, 772 aa, 1 stop 
xMW: 87002, pi: 4.64, NX(S/T) : 8 

MNCYLLLRFMLGIPLLWPCLGATENSQTKKVKQPWSHLRVKRGWVPmQFFVPEEMNT^ 

IGQLRSDLDNGNNSFQYKLLGAGAGSTFIIDERTGDIYAIQKLDREERSLYILRAQVIDIAT 

GRAVEPESEFVIKVSDINDNEPKFLDEPYEAIVPEMSPEGTLVIQVTASDADDPSS6NNARL 

LYSLLQGQPYFSVEPTTGVIRISSKMDRELQDEYWVIIQAKDMIGQPGALSGTTSVLIKLSD 

VNDNKPIFKESLYRLTVSESAPTGTSIGTIMAYDNDIGENAEMDYSIEEDDSQTFDIITNHE 

TQEGIVILKKKVDFEHQimYGIRAiWKNHHVPEQLMKYHTEASTTFIKIQVEDVDEPPIiFLL 

PYYVFEVFEETPQGSFVGWSATDPDNRKSPIRYSITRSKVFNINDNGTITTSNSLDREISA 

TmJLSITATEKYNIEQISSIPLYVQVLNINDHAPEFSQYYETYVCENAGSGQVIQTISAVDR 

DESIEEHHFYFNLSVEDTNNSSFTIIDNQDNTAVILTNRTGFNLQEEPVFYISILIADNGIP 

SLTSTNTLTIHVCDCGDSGSTQTCQYQELVLSMGFKTEVIIAILICIMIIFGFIFLTLGLKQ 

RRKQILFPEKSEDFRENIFQYDDEGGGEEDTEAFDIAELRSSTIMRERKTRKTTSAEIRSLY 

RQSLQVGPDSAIFRKFILEKLEEANTDPCAPPFDSLQTYAFEGTGSLAGSLSSLESAVSDQD 

ESYDYLNELGPRFKRLACMFGSAVQSNN 

Important features? 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 
amino acids 597-617 

N-glycosylation sites. 

amino acids 57-60, 74-77, 419-423, 437-440, 508-511, 515-518, 
516-519 and 534-537 

Cadherins extracellular repeated domain signature, 
amino acids 136-146 and 244-254 



FIGURE 96 



ATTTCAAGGCCAGCCATATTTTTNTGTTGAACCAACAACAGGAGTCATAAGAATATTTTNTA 
AAATGGATAGAGAACTGCAAGATGAGTATTGGGTAATCATTCAAGCCAAGGACATGATTGGT 
CAGCCAGGAGCGTTGTlTrGGAACAAa^GTGTATTAATTAAACTTTCAGATGTTAATGACAA 
TAAGCCTATATTTAAAGAAAGTTTATACCGCTTGACTGTNTNTGAATCTGCACCCACTGGGA 
NTTNTATAGGAACAATCATGGCATATGATAATGACATAGGAGAGAATGCAGAAATGGATTAC 
AGCATTGAAGAGGATGATTCGCAAACATTTGACATTATT 



FIGURE 97 



GCAACCTCAGCTTCTAGTATCCAGACTCCAGCGCCGCCCCGGGCGCGGACCCCAACCCCGAC 

CCAGAGCTTCTCCAGCGGCGGCGCAGCGAGCAGGGCTCCCCGCCTTAACTTCCTCCGCGGGG 

CCCAGCCACCTTCGGGAGTCCGGGTTGCCCACCTGCAAACTCTCCGCCTTCTGCACCTGCCA 

CCCCTGAGCCAGCGCGGGCCCCCGAGCGAGTCATGGCCAACGCGGGGCTGCAGCTGTTGGGC 

TTCATTCTCGCCTTCCTGGGATGGATCGGCGCCATCGTCAGCACTGCCCTGCCCCAGTGGAG 

GATTTACTCCTATGCCGGCGACAACATCGTGACCGCCCAGGCCATGTACQAGGGGCTGTGGA 

TGTCCTGCGTGTCGCAGAGCACCGGGCAGATCCAGTGCAAAGTCTTTGACTCCTTGCTGAAT 

CTGAGCAGCACATTGCAAGCAACCCGTGCCTTGATGGTGGTTGGCATCCTCCTGGGAGTGAT 

AGCAATCTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGTGCTTGGAAGACGATGAGGTGC 

AGAAGATGAGGATGGCTGTCATTGGGGGTGCGATATTTCTTCTTGCAGGTCTGGCTATTTTA 

GTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAATTCTATGACCCTATGACCCCAGT 

CAATGCCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCC 

TTCTGGGAGGTGCCCTACTTTGCTGTTCCTGTCCCCGAAAAACAACCTCTTACCCAACACCA 

AGGCCCTATCCAAAACCTGCACCTTCCAGCGGGAAAGACTACGTGTGACACAGAGGCAAAAG 

GAGAAAATCATGTTGAAACAAACCGAAAATGGACATTGAGATACTATCATTAACATTAGGAC 

CTTAGAATTTTGGGTATTGTAATCTGAAGTATGGTATTACUy^CAAACAAAaL?^C^^ 

ACCCATGTGTTAAAATACTCAGTGCTAAACATGGCTTAATCTTATTTTATCTTCTTTCCTCA 

ATATAGGAGGGAAGATTTTTCC^TTTGTATTACTGCTTCCCATTGAGTAATCATACTCAAAT 

GGGGGAAGGGGTGCTCCTTAAATATATATAGATATGTATATATACATGTTTTTCTATTAAAA 

ATAGACAGTAAAATACTATTCTCATTATGTTGATACTAGCATACTTAAAATATCTCTAAAAT 

AGGTAAATGTATTTAATTCCATATTGATGAAGATGTTTATTGGTATATTTTCTTTTTCGTCC 

TTATATACATATGTAACAGTCAAATATCATTTACTCTTCTTCATTAGCTTTGGGTGCCTTTG 

CCACAAGACCTAGCCTAATTTACCAAGGATGAATTCTTTCAATTCTTCATGCGTGCCCTTTT 

CATATACTTATTTTATTTTTTACCATAATCTTATAGCACTTGCATCGTTATTAAGCCCTTAT 

TTGTTTTGTGTTTCATTGGTCTCTATCTCCTGAATCTAACACATTTCATAGCCTACATTTTA 

GTTTCTAAAGCCAAGAAGAATTTATTACAAATCAGAACTTTGGAGGCAAATCTTTCTGCATG 

ACCAAAGT6ATAAATTCCTGTTGACCTTCCCACACAATCCCTGTACTCTGACCCATAGCACT 

CTTGTTTGCTTTGAAAATATTTGTCCAATTGAGTAGCTGCATGCTGTTCCCCCAGGTGTTGT 

AACACAACTTTATTGATTGAATTTTTAA6CTACTTATTCATAGTTTTATATCCCCCTAAACT 

ACCTTTTTGTTCCCCATTCCTTAATTGTATTGTTTTCCCAAGTGTAATTATCATGCGTTTTA 

TATCTTCCTAATAAGGTGTGGTCTGTTTGTCTGAACAAAGTGCTAGACTTTCTGGAGTGATA 

ATCTGGTGACAAATATTCTCTCTGTAGCTGTAAGCAAGTCACTTAATCTTTCTACCTCTTTT 

TTCTATCTGCCAAATTGAGATAATGATACTTAACCAGTTAGAAGAGGTAGTGTGAATATTAA 

TTAGTTTATATTACTCTTATTCTTTGAACATGAACTATGCCTATGTAGTGTCTTTATTTGCT 

CAGCTGGCTGAGACACTGAAGAAGTCACTGAACAAAACCTACACACGTACCTTCATGTGATT 

CACTGCCTTCCTCTCTCTACCAGTCTATTTCCACTGAACAAAACCTACACACATACCTTCAT 

GTGGTTCAGTGCCTTCCTCTCTCTACCAGTCTATTTCCACTGAACAAAACCTACGCACATAC 

CTTCATGTGGCTCAGTGCCTTCCTCTCTCTACCAGTCTATTTCCATTCTTTCAGCTGTGTCT 

GACATGTTTGTGCTCTGTTCCATTTTAACAACTGCTCTTACTTTTCCAGTCTGTACAGAATG 

CTATTTCACTTGAGCAAGATGATGTAATGGAAAGGGTGTTGGCACTGGTGTCTGGAGACCTG 

GATTTGAGTCTTGGTGCTATCAATCACCGTCTGTGTTTGAGCAAGGCATTTGGCTGCTGTAA 

GCTTATTGCTTCATCTGTAAGCGGTGGTTTGTAATTCCTGATCTTCCCACCTCACAGTGATG 

TTGTGGGGATCCAGTGAGATAGAATACATGTAAGTGTGGTTTTGTAATTTAAAAAGTGCTAT 

ACTAAGGGAAAGAATTGAGGAATTAACTGGATACGTTTTGGTGTTGCTTTTCAAATGTTTGA 

AAATAAAAAAAATGTTAAG 



FIGURE 98 



></usr/seqclb2/sst/DNA/Dnaseqs .min/ss .DNA52185 
xsubunit 1 of 1, 211 aa, 1 stop 
xMW: 22744, pi: 8.51, NX(S/T) : 1 

MANAGLQLLGFILAFLGWIGAIVSTALPQWRIYSYAGDNIVTAQAMYEGLWMSCVSQSTGQI 
QCKVFDSLLNLSSTLQATRALMWGILLGVIAIFVATVGMKCMKCLEDDEVQKMRMAVIGGA 
IFLLAGLAILVATAWYGNRIVQEFYDPMTPVNARYEFGQALFTGWAAASLCLLGQALLCCSC 
PRKTTSYPTPRPYPKPAPSSGKDYV 

Important features: 
Signal peptide: 

amino acids 1-21 

Transmembrane domains: 

amino acids 82-102, 118-142 and 161-187 

N-glycosylation site, 
amino acids 72-75 

PMP-22 / EMP / MP20 family proteins 

amino acids 70-111 

ABC-2 type transport system integral membrane protein 

amino acids 119-133 



FIGURE 99 

TTCTGGCCAAACCCGGGGCTNCAGCTGTTGGGCTTCATCTCGCCTTCCTGGGATGGATCGGC 

GCCATCNTCacyiCTGCCCTTCCCCAGTGGAGQATTTTACTCCCTATGCTGGCGACAACATCG 

TGACCGCCCAGCCCATGTACGAGGGGCTGTGGATGTCCNGCGTGTCGCAGAGCACCGGGCAG 

ATCC^GTGOiAAGTCTTTGACTCCTTGCTGAATCTGAGCAGC^a^TTGCa^GC^ 

CTTGATG6TGGTTGGCATCCTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTTGGCATGA 

AGTGTATGAAGTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGC 

GCGATATTTCTTCTTGCAGGTCTGGCTATTTTAGTTGCCACAGCATGGTATGGCAATAGAAN 

CNTTCAACANTTCTATGACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCA 

GGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTGCCCTACTTTGCT 

GTTCCTGTCCC 



FIGURE 100 



ACCCTTGACCCAACGCGGCCCCCCGACCGNTTCATGGCCAAACGCGGGNCTCCAGCTGTTGG 
GCTTCATTCTCCCCTTCCTGGGATGGACCGGCGCCCATCNTCAGCACTGCCCTGCCCCAGTG 
GAGGATTTACTCCTATNCCGGCaJACAACATCGTGACCGCCCAGGCC3!TTGTACGAGGGGCTGT 
GGATGTCCTGCGTGTCGCAGAGCACCGGGCAGATCCAGTGCAAAGTCTTTGACTCCCTTGCT 
GAATCTGAGCAGGACATTGCAAGCAACCCGTGCCTTGATGGTGGTTGGCATCCTCCTGGGAG 
TGATAGCAATCTTNNTGGCCACCGTTGTNWNTGAAGTGTATGAAGTGCTTGGAAGACGATGA 
GGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTTCTTGCAGGTCTGGCTA 
TTTTAGTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAATTCTATGACCCT^ 



FIGURE 101 



GGGCCCGACCATTATCCAACCGGGNTCACTGTTGGCTCATCTCCCTCCTGGATGAANCGCGC 
CATaJTCAGACTCCCTGCCCaiTGGAGATTTmCCTATGCTGGCGACAA.CATCNTGACCCCC 
AGCCATGTACGAGGGGCTTTGAACGTCNGCGTGTCGCAGANCACCGGGCAGATCCAGTGCAA 
AGTCTTTGACTCCTTGCTGAATCTGNGCAGCACATTGCAGCAACCC33TGCCCTGATGGTGGT 
TGGCATCCTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGT 
GCTTGGAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTT 
CTTGCAGGTCTGGCTATTTNMNGTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAAT 
TCTATGACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGC 
TGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTGCCCTACTTTGCTGTTCCTGCGA 



FIGURE 102 



ATTCTCCCCTCCTGGATGGATCGCNCCACCGTCACATTGCCTTCCCCCANTGGAGGATTNAC 
TCCTATGCTGGCGACAACATCGTGACCCCCCAGGCCATTTACCGAGGGGCTTTGGATGTCNT 
GCaSTTGTCGCAGAGCACCGGGCAGATCCCAGTGCAAAGTCTTTGACTCCTTGCTGAATCTGAG 
C^GCACATTGC^GCAACCCGTGCCTTGATGGGGTTGGCATCCTCCTGGGAGTGATAGCAAC 
CTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGTGCTTGGAAGACGATGAGGTGCCAGAAG 
ATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTTGTTGCAGGTCTGGCTATTTTAGTNGC 
CACAGCATGGTATGGCAATAGAlSrrNNTTCNNGlSIl!lNTCTATGACCCT 

CCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTG 
GGAGGTGCCCTACTTTGCTGTTCCTGTCCC 



FIGTIRE 103 



AGAGCACCGGCAGATCCCAGTNCAAAGTCTTTGACCCTTGCTGAATCTGAGCAGCACATmC 
AAGCAACCCCTTGCCTTGAAGGTGGTTGNCATCCCCCCTGGGAGTGAATAGCAATCTTTGTG 
GCCACCGTTGGCATGAAGTNTATGAAGTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGAT 
GGCTGTCT^TTGGGGGCGCGATATTTCTTCTTGCAGGTCTGGCTATTTTAGTNNCCACAGCAT 
GGTATGGCAATAGNATNNTTCGNGGNTTCTATGACCCTATGACCCCAGTCAATGCCAGGTAC 
GAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTGC 
CCTACTTTGCTGTTCCTGTCCCCGAA 



FIGURE 104 



AGCAATGCCCTGCCCCa^GTGGAGGATTAATTCCTATGNTGGGGACAACA.TTGTGACNGCCC 

AGGCCATGTACGGGGGGCTGTGGATGTCCTGCGTGTCGCAGAGCACCGGGCAGATCCAGTGC 

AAAGTNTTTGACTCCrroCTGAATTTGAGCAGCaCATTGCAAGCAACCCGTGCCT 

GGTTGGCATCTTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTGGNAATGAAGTGTATGA 

AGTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTT 

CTTNTTGCAGGTCTGGCTATTTTAGTTGC<^a^Ga^TGGTATGGCy^TAGAATNGTTCAAGA 

ATTTTATGACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCAGGCTTTNTTCACTG 

GCTGGGCTGCTGCTTNTTTCTGCCTTOTGGGAGGTGCCCTANTTTGCTGTTCCTGCGAACC 



FIGURE 105 



TCATAGGGGGGCGCGATATTTTTTCTTGCAGGTNTGGTTATTTTAGTTGCCACAGCATGGTA 
TGGCAATAGAATCGTTC2UIGAATTNTATGACCCTATGACCCCAGTCAATGCCAGGTACGAAT 
TTGGTCatGGCTCTNTTCa^CTGGITrGGGCTGCTGCTTCTNTNNGCCTTOT 
CTTTGCTGTTCCTG 



FIGURE 106 



TTCCTGGGATGGATCCGCCCCCaiTCNTCACATGCCCTGCCCCNTGGAGATTTACNCCTATGC 

TGGCGAACAACATCNTGACCGCCCAGGCCATGTACGAGGGGCTGTGGAATGTCCTGCGTGTC 

CCAGAGCACCGGGCAGATCCAGTGCAAAGTCTTTGACTCCTTGCTGAATCTGAGCAGCACAT 

TGCAAGCAACCNTGCCTTGATGGTGGTTGGCATCCTCCTGGGAGTGATAGCAATCTTTGTGG 

CCACCGTTGGCATGAAAGTGTATGAAGTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGAT 

GGCTGTCATTGGGGGCGCGATATTTCTTCTTGCAGGTCTGGCTATTTTAGNNGCCAC^GC^ 

GGTATGGCAATCAGACCCNNTCANAAACTCTATGACCCTATGACCCCAGTCAATGCCAGGTA 

CGAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTG 

CCCTACTTTGCTGTTCCTGTCCCC6AAAAACAACCTCTTACCCACG 



FIGURE 107 



CGGGGCTGCAGCTGTTGGGCTTCATCTCGCTTCCTGGGATGGAATCGGCGCCATCGTCAGCA 
CTGCCCTGCCCC^TGGAGQATTTACTCaiTATGCTGGCGACAACATCGTGACCNCCCAGGCCA 
TGTACGAGGGGCTGTGGATGTCaiGCGTGTCGCAGAGCACCGGGCAGATCCAGTGCAAAGTCT 
TTGACTCCTTGCTGAATCTGAGCaLGC^CATTGCAAGC^CC3!lTGCCTTGATGGTGGTTGGC^ 
TCCTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGTGCTTG 
GAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTTCTTGC 
AGGTCTGGCTATTTNTAGTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAATTCTAT 
GACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGCTGGGC 
TGCTGCTTCTCTCTGCCTTCTGGGAGGTGCCCTACTTTGCTGTTCCTGCGAA 



FIGURE 108 



GCGTGCCGTCAGCTCGCCGGGCACCGCGGCCTCGCCCTCGCCCTCCGCCCCTGCGCCTGCAC 
CGCGTAGACCGACCCCCCCCTCCAGCGCGCCCACCCGGTAGAGGACCCCCGCCCGTGCCCCG 
ACCGGTCCCCGCCTTTTTGTAAAACTTAAAGCGGGCGCAGCATTAACGCTTCCCGCCCCGGT 
GACCTCTCAGGGGTCTCCCCGCCAAAGGTGCTCCGCCGCTAAGGAAC ATGG CGAAGGTGGAG 
CAGGTCCTGAGCCTCGAGCCGCAGCACGAGCTCAAATTCCGAGGTCCCTTCACCGATGTTGT 
CACCACCAACCTAAAGCTTGGCAACCCGACAGACCGAAATGTGTGTTTTAAGGTGAAGACTA 
CAGCACCACGTAGGTACTGTGTGAGGCCCAACAGCGGAATCATCGATGCAGGGGCCTCAATT 
AATGTATCTGTGATGTTACAGCCTTTCGATTATGATCCCAATGAGAAAAGTAAACACAAGTT 
TATGGTTCAGTCTATGTTTGCTCCAACTGACACTTCAGATATGGAAGCAGTATGGAAGGAGG 
CAAAACCGGAAGACCTTATGGATTCAAAACTTAGATGTGTGTTTGAATTGCCAGCAGAGAAT 
GATAAACCACATGATGTAGAAATAAATAAAATTATATCCACAACTGCATCAAAGACAGAAAC 
ACCAATAGTGTCTAAGTCTCTGAGTTCTTCTTTGGATGACACCGAAGTTAAGAAGGTTATGG 
AAGAATGTAAGAGGCTGCAAGGTGAAGTTCAGAGGCTACGGGAGGAGAACAAGCAGTTCAAG 
GAAGAAGATGGACTGCGGATGAGGAAGACAGTGCAGAGCAACAGCCCCATTTCAGCATTAGC 
CCCAACTGGGAAGGAAGAAGGCCTTAGCACCCGGCTCTTGGCTCTGGTGGTTTTGTTCTTTA 
TCGTTGQTGTAATTATTGGGAAGATTGCCTT GTAGA GGTAGCATGCACAGGATGGTAAATTG 
GATTGGT6GATCCACCATATCATGGGATTTAAATTTATCATAACCATGTGTAAAAAGAAATT 
AATGTATGATGACATCTCACAGGTCTTGCCTTTAAATTACCCCTCCCTGCACACACATACAC 
AGATACACACACACAAATATAATGTAACGATCTTTTAGAAAGTTAAAAATGTATAGTAACTG 
ATTGAGGGGGAAAAAGAATGATCTTTATTAATGACAAGGGAAACCATGAGTAATGCCACAAT 
GGCATATTGTAAATGTCATTTTAAACATTGGTAGGCCTTGGTACATGATGCTGGATTACCTC 
TCTTAAAATGACACCCTTCCTCGCCTGTTGGTGCTGGCCCTTGGGGAGCTGGAGCCCAGCAT 
GCTGGGGAGTGCGGTCAGCTCCACACAGTAGTCCCCACGTGGCCCACTCCCGGCCCAGGCTG 
CTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGCTCCTTGGGACTGATGAACAGAGTCAGA 
AGCCCAAAGGAATTGCACTGTGGCAGCATCAGACGTACTCGTCATAAGTGAGAGGCGTGTGT 
TGACTGATTGACCCAGCGCTTTGGAAATAAATGGCAGTGCTTTGTTCACTTAAAGGGACCAA 
GCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAACTGTTATTCAGAGATGTTTAATGCATA 
TTTAACTTATTTAAT6TATTTCATCTCATGTTTTCTTATTGTCACAAGAGTACAGTTAATGC 
TGCGTGCTGCTGAACTCTGTTGGGTGAACTGGTATTGCTGCTGGAGGGCTGTGGGCTCCTCT 
GTCTCTGGAGAGTCTGGTCATGTGGAGGTGGGGTTTATTGGGATGCTGGAGAAGAGCTGCCA 
GGAAGTGTTTTTTCTGGGTCAGTAAATAACAACTGTCATAGGGAGGGAAATTCTCAGTA6TG 
ACA6TCAACTCTAGGTTACCTTTTTTAATGAAGAGTAGTCAGTCTTCTAGATTGTTCTTATA 
CCACCTCTCAACCATTACTCACACTTCCAGCGCCCAGGTCCAAGTCTGAGCCTGACCTCCCC 
TTGGGGACCTAGCCTGGAGTCAGGACAAATGGATCGGGCTGCAGAGGGTTAGAAGCGAGGGC 
ACCAGCAGTTGTGGGTGGGGAGCAAGGGAAGAGAGAAACTCTTCAGCGAATCCTTCTAGTAC 
TAGTTGAGAGTTTGACTGTGAATTAATTTTATGCCATAAAAGACCa^CCCAGTTCTGTTTGA 
CTATGTAGCATCTTGAAAAGAAAAATTATAATAAAGCCCCAAAATTAAGAAAA 



FIGURE 109 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA53977 
<subunit 1 of 1, 243 aa, 1 stop 
<MW: 27228, pi: 7.43, NX(S/T) : 2 

MZUCVEQVLSLEPQHELKFRGPFTDVVTTl^KLGNPTDRNVCFKVKTTAPRRYCVRPNSGIID 
AGASINVSVMLQPFDYDPNEKSKHKFMVQSMFAPTDTSDMEAWKEAKPEDIJIDSKLRCVFE 
LPAEiroKPHDVEINKIISTTASKTETPIVSKSLSSSIJ)DTEVKKVMEECKRLQGEVQRLREE 
NKQFKEEDGLRMRKTVQSNSPISAIAPTGKEEGLSTRLIiALVVLFFIVGVIIGKIAL 

Important features : 
Transmembrane domain: 
amino acids 224-239 

N-glycosylation site. 

amino acids 68-71 

N-myristoylation site. 

amino acids 59-64, 64-69 and 235-240 



FIGURE 110 



GTCa.GTCTTCTAGATTGTCCTTATCCCACCTTTCAACCANTACTCACATTTCNAGCGCCCAG 
GTCCANGTCTGAGCCTGACTTCCCCTTGGGGACCTAGCCTGGAGTCAGGACAATGGNTCGGG 
CTGCAGAGGNTTAGAAGCGAGGGCACCAGCAGTTTTGGGTGGGGAGCAAGGGNNGAGAGAAA 
CTCTTCAGCGAATCCTTCTAGTACTAGTTGAGAGTTTGACTGTGAATTAATTTTATGCCATA 
AAAGAOTAACCCAGTTCTGTTTGACTATGTAGCATCTTGAAAAGAAAAATTATAATAAAGCC 
CCAAAATTAAGAATTCTTTTGTCATTTTGTCACATTTGCTCTATGGGGGGAATTATTATTTT 
ATCATTTTTATTATTTTGCCATTGGAAGGTTl^CTTTAAAATGAGC 



FIGURE 111 

TATTGTAAAGGCCATTTTAAACCATTGGTAGGCCTTGGTACATGATGCTGGATTACCTCCTT 
AAATGACACCNTTCCTCGCCTGTTGGTGCTGGCCNTTGGGGAGCTGGAGCCCCAGCATGCTG 
GGGAGTGCGGTCAGCTCCACACAGTAGTCCCCACGTGGCCCACTCCCGGCCCAGGCTGCTTT 
CCGTGTCTTCAGTTCTGTCCAAGCCATCAGCTCCTTGGGACTGATGAACAGAGTCAGAAGCC 
O^GGAATTGCCACTGTGGCAGCATCAGACGTACTCGTCATAAGTGAGAGGCGTGTGTTGA 
CTGATTGACCCAGCGCTTTGGAAATAaATGGCAGTGCTTTGTTCACTTAAAGGGACCAAGCT 
AAATTGTATTGGTTCATGTAGTGAAGTCAAACTGTTATTCAGAGATGTTTAATGCATATTTA 
ACTTATTTAATGTATTTCATCTCATGTTTTCTTATTGTCACAAGAGTACAGTTAATGCTGCG 
TGCTGCTGAACTCTGTTGGGTGAACTGGTATTGCTGCTGGAGGGCTG 



FIGURE 112 



CCCTGGTGGTTTTGTTCTTTAATTCGTTGGTGTAATTNTTGGGAAGATTGCTTGTAGAGGTA 

GNATGCACCNGGCTGGTAAATTGGATTGGTGGATCCACCATATCCATGGGATTTAAATTTAT 

CTlTAACCATGTGTAAAAAGAZiATTAATGTATGATGACATNTCACAGGTATTGCCTTTAZ^TT 

ACCCATCCCTGNANACACATACACAGATACACaNANACAAATNTAATGTAACGATNTTTT^ 

AAAGTTAAAAATGTATAGTAAC 



FICSURE 113 

GGTGGCCCATTCCCGGCCCAGGCTGCTTTCCGGTNTTCAGTTCTGTCCAAGCCATCAGCTCC 
TTGGGACTGATGAACAGAGTCAGAAGCCCAAAGGAATT6CACTGTGGCAGCATNAGACGTAC 
TTGTNATAAGTGAGAGGCGTGTGTTGACTGATTGACCCAGCGCTTTGGAAATAAATGGCAGT 
GCTTTGTTCANTTAAAGGGACCAAGCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAACTG 
TTATTCa^GAGATGTTTAATGCATATTTAANTTATTTAATGTATTTNATNTCATGTTTTCTTA 
TTGTCa^CAAGAGTACAGTTAATGCTGCGTGCTGCTGAANTNTGTTGGGTGAACTGGTATTGC 
TGCTGGAGGGCTGTGGGCTCCTCTGTCTTTGGAGAGTCTGGTCATGTGGAGGTGGG 



FIGURE 114 



TGCTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGCTCCTTGGGACTTGATGAACAGAGTC 

AGAAGCCCAAAGGAATTGCACTGTGGCaLGCATCaiGAaSTACrCGTCATAAGTGAQA 

TGTTGACTGATTGACCCAGCGCTTTGGAAATAAATGGCAGTGCTTTGTTCACTTAAAGGGAC 

CAAGCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAACTGTTATTCAGAGATGTTTAATGC 

ATATTTAACTTATTTAATGTATTTCATCTCATGTTTTCTTATTGTO^CAAGAGTACAG^ 

TGCTGCGTGC 



FIGURE 115 

AAACCTTTAAAAGTTGAGGGGAAAAGAATGATCCTTTATTAATGACAAGGGAAACCNTGlSrGT 

AATGCGACaATGGCATATTGTAAATGTCI^TTTTAAACATTGGTAGGCCTTGGTACATG^^ 

TGGATTACCTCTCTTAAAATGACACCCTTCCTCGCCTGTTGGTGCTGGCCCTTGGGGAGCTN 

GAGCCCAGCATGCTGGGGAGTGCGGTCTGCTCCACACAGTAGTCCCCANGTGGCCCANTCCC 

GGCCCAGGCTGCTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGCTCCTTGGGANTGATGA 

A(^GAGTCAGAAGCCCaAAGGAATTGC^TGTGGCAGCAT<^GANGTANTNGTCATAAGTGA 

GAGGCGTGTGTTGANTGATTGACCCAGCGCTTTGGAAATAAATGGCAGTGCTTTGTTCANTT 

AAAGGGNCCAAGNTAZ^TTTGTATTGGTTCATGTAGTGAAGTCAAANTGTTATTCAGAGATG 

TTTAATGCATATTTAANTTATTTAATGTATTTCATNTCATGTTTTCTTATTGTCACAAGGGT 

ACAGTTAATGCTGCGTGCTGCTGAANTCTGTTGGGTGAANTGGTATTGCTG 



FIGURE 116 



GGCCCTTGGGGAGCTGGAGCCCAGCATGCTGGGGAGTGCGGTCAGCTCCACACAGTAGTCCC 
CACGTGGCCCACTCCCGGCCCAGGCTGCTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGC 
TCCTTGGGACTGATGAACAGAGTCAGAAGCCCAAAGGAATTGCACTGTGGCAGCATCAGACG 
TACTCGTCATAAGTGAGAGGCGTGTGTTGACTGATTGACCCAGCGCTTTGGAAATAAATGGC 
AGTGCTTTGTTCACTTAAAGGGACCAAGCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAA 
CTGTTATTCAGAGATGTTTAATGCATATTTAACTTATTTAATGTATTTCATCTCATGTTTTC 
TTATTGTCACAAGAGTACAGTTAATGCTGCGTGCTGCTGAACTCTGTTGGGTGAACTGGTAT 
TGCTGCTGGAGGGCTGTGGGCTCCTCTGTCTCTGGAGAGTCTGGTCATGTGGAGGTGGG 



FIGURE 117 



GCGAGCTCCGGGTGCTGTGGCCCGGCCTTGGCGGGGCGGCCTCCGGCTCAGGCTGGCTGAGA 
GGCTCCCAGCTGCAGCGTCCCCGCCCGCCTCCTCGGGAGCTCTGATCTCAGCTGACAGTGCC 
CTCGGGGACCAAACAAGCCTGGCAGGGTCTCACTTTGTTGCCCAGGCTGGAGTTCAGTGCCA 
TGATCATGGTTTACTGCAGCCTTGACCTCCTG6GTTCAAGCGATCCTGCTGAGTAGCTGGGA 
CTACAGGACAAAATTAGAAGATCAAAATGGAAAATATGCTGCTTTGGTTGATATTTTTCACC 
CCTGGGTGGACCCTCATTGATGGATCTGAAATGGAATGGGATTTTATGTGGCACTTGAGAAA 
GGTACCCCGGATTGTCAGTGAAAGGACTTTCCATCTCACCAGCCCCGCATTTGAGGCAGATG 
CTAAGATGATGGTAAATACAGTGTGTGGCATCGAATGCCAGAAAGAACTCCCAACTCCCAGC 
CTTTCTGAATTGGAGGATTATCTTTCCTATGAGACTGTCTTTGAGAATGGCACCCGAACCTT 
AACCAGGGTGAAAGTTCAAGATTTGGTTCTTGAGCCGACTCAAAATATCACCACAAAGGGAG 
TATCTGTTAGGAGAAAGAGACAGGTGTATGGCACCGACAGCAGGTTCAGCATCTTGGACAAA 
AGGTTCTTAACCAATTTCCCTTTCAGCACAGCTGTGAAGCTTTCCACGGGCTGTAGTGGCAT 
TCTCATTTCCCCTCAGCATGTTCTAACTGCTGCCCACTGTGTTCATGATGGAAAGGACTATG 
TCAAAGGGAGTAI^AAAGCTAAGGGTAGGGTTGTTGAAGATGAGGAATAAAAGTGGAGGCAAG 
l^CGTCGAGGTTCTAAGAGGAGCAGGAGAGAAGCTAGTGGTGGTGACCAAAGAGAGGGTAC 
CAGA6AGCATCTGCAGGAGAGAGCGAAGGGTGGGAGAAGAAGAAAAAAATCTGGCCGGGGTC 
AGAGGATTGCCGAAGGGAGGCCTTCCTTTCAGTGGACCCGGGTCAAGAATACCCACATTCCG 
AAGGGCTGGGCACGAGGAGGCATGGGGGACGCTACCTTGGACTATGACTATGCTCTTCTGGA 
GCTGAAGCGTGCTCACAAAAAGAAATACATGGAACTTGGAATCAGCCCAACGATCAAGAAAA 
TGCCTGGTGGAATGATCCACTTCTCAGGATTTGATAACGATAGGGCTGATCAGTTGGTCTAT 
CGGTTTTGCAGTGTGTCCGACGAATCCAATGATCTCCTTTACCAATACTGCGATGCTGAGTC 
GGGCTCCACCGGTTCGGGGGTCTATCTGCGTCTGAAAGATCCAGACAAAAAGAATTGGAAGC 
GCAAAATCATTGCGGTCTACTCAGGGCACCAGTGGGTGGATGTCCACGGGGTTCAGAAGGAC 
TACAACGTTGCTGTTCGCATCACTCCCCTAAAATACGCCCAGATTTGCCTCTGGATTCACGG 
GAACGATGCCAATTGTGCTTACGGCTAACAGAGACCTGAAACAGGGCGGTGTATCATCTAAA 
TCACAGAGAAAACCAGCTCTGCTTACCGTAGTGAGATCACTTCATAGGTTATGCCTGGACTT 
GAACTCTGTCAATAGCATTTCAACATTTTTCAAAATC^GGAGATTTTCGTCCATTTAAAAAA 
TGTATAGGTGCAGATATTGAAACTAGGTGGGO^CTTCAATGCCAAGTATATACTCTTCTTTA 
CATGGTGATGAGTTTCATTTGTAGAAAAATTTTGTTGCCTTCTTAAAAATTAGACACACTTT 
AAACCTTCAAACAGGTATTATAAATAACATGTGACTCCTTAATGGACTTATTCTCAGGGTCC 
TACTCTAAGAAGAATCTAATAGGATGCTGGTTGTGTATTAAATGTGAAATTGCATAGATAAA 
GGTAGATGGTAAAGCAATTAGTATCAGAATAGAGACAGAAAGTTACAACACAGTTTGTACTA 
CTCTGAGATGGATCCATTCAGCTCATGCCCTCAATGTTTATATTGTGTTATCTGTTGGGTCT 
GGGACATTTAGTTTAGTTTTTTTGAAGAATTACAAATCAGAAGAAAAAGCAAGCATTATAAA 
CAAAACTAATAACTGTTTTACTGCTTTAAGAAATAACAATTACAATGTGTATTATTTAAAAA 
TGGGAGAAATAGTTTGTTCTATGAAATAAACCTAGTTTAGAAATAGGGAAGCTGAGACATTT 
TAAGATCTCAAGTTTTTATTTAACTAATACTCAAAATATGGACTTTTCATGTATGCATAGGG 
AAGACa^CTTCACAAATTATGAATGATCATGTGTTGAAAGCCACATTATTTTATGCTATACAT 
TCTATGTATGAGGTGCTACATTTTTAGGACAAAGAATTCTGTAATCTTTTTCAAGAAAGAGT 
CTTTTTCTCCTTGACAAAATCCAGCTTTTGTATGAGGACTATAGGGTGAATTCTCTGATTAG 
TAATTTTAGATATGTCCTTTCCTAAAAATGAATAAAATTTATGAATATGA 



FIGURE 118 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA57253 
<subunit 1 of 1, 413 aa, 1 stop 
<MW: 47070, pi: 9.92, NX(S/T) : 3 

MENMLLWLIFFTPGWTLIDGSEMEWDFMWHLRKVPRIVSERTFHLTSPAFEM)AKMMVNTVC 
GIECQKELPTPSLSELEDYLSYETVFENGTRTLTRVKVQDLVLEPTQNITTKGVSVRRKRQV 
YGTDSRFSILDKRFLTNFPFSTAVKLSTGCSGILISPQHVLTAAHCVHDGKDYVKGSKKLRV 
GLLKMRNKSGGKKRRGSKRSRREASGGDQREGTREHLQERAKGGRRRKKSGRGQRIAEGRPS 
FQWTRVKNTHI PKGWARGGMGDATLDYD YALLELKRAHKKKYMELG I SPT I KKMPGGMI HFS 
GFDNDRADQLVYRFCSVSDESNDLLYQYCDAESGSTGSGVYLRLKDPDKKNWKRKIIAVYSG 
HQWVDVHGVQKDYNVAVRITPLKYAQICLWIHGNDANCAYG 

Iinportant features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 90-93, 110-113 and 193-196 

Glycosaminoglyccui attachment site. 

amino acids 236-239 

Serine proteases, trypsin family, histidine active site, 
amino acids 165-170 



FIGURE 119 



AATGTGAGAGGGGCTGATGGAAGCTGATAGGCAGGACTGGAGTGTTAGCACCAGTACTGGAT 

GTGACAGC^GGCAGAGGAGCACTTAGCAGCTTATTCAGTGTCCGATTCTGATTCCGGCAAGG 

ATCCAAG CATGG AATGCTGCCGTCGGGCAACTCCTGGCACACTGCTCCTCTTTCTGGCTTTC 

CTGCTCCTGAGTTCCAGGACCGCACGCTCCGAGGAGGACCGGGACGGCCTATGGGATGCCTG 

GGGCCCATGGAGTGAATGCTCACGCACCTGCGGGGGAGGGGCCTCCTACTCTCTGAGGCGCT 

GCCTGAGCAGCAAGAGCTGTGAAGGAAGAAATATCCGATACAGAACATGCAGTAATGTGGAC 

TGCCCACCAGAAGCAGGTGATTTCCGAGCTCAGCAATGCTCAGCTCATAATGATGTCAAGCA 

CCATGGCCAGTTTTATGAATGGCTTCCTGTGTCTAATGACCCTGACAACCCATGTTCACTCA 

AGTGCCAAGCCAAAGGAACAACCCTGGTTGTTGAACTAGCACCTAAGGTCTTAGATGGTACG 

CGTTGCTATACAGAATCTTTGGATATGTGCATCAGTGGTTTATGCCAAATTGTTGGCTGCGA 

TCACCAGCTGGGAAGCACCGTCAAGGAAGATAACTGTGGGGTCTGCAACGGAGATGGGTCCA 

CCTGCCGGCTGGTCCGAGGGCAGTATAAATCCCAGCTCTCCGCAACC2^2^TCGGATGATACT 

GTGGTTGCACTTCCCTATGGAAGTAGACATATTCGCCTTGTCTTAAl^GGTCCTGATCACTT 

ATATCTGGAAACCaLAAACCCTCCAGGGGACTAAAGGTGAAAACAGTCTCAGCTCCACaiGGAA 

CTTTCCTTGTGGACAATTCTAGTGTGGACTTCCAGAAATTTCCAGACAAAGAGATACTGAGA 

ATGGCTGGACCACTCACAGCAGATTTCATTGTCAAGATTCGTAACTCGGGCTCCGCTGACAG 

TACAGTCCAGTTCATCTTCTATCAACCCATCATCCACCGATGGAGGGAGACGGATTTCTTTC 

CTTGCTCAGCAACCTGTGGAGGAGGTTATCAGCTGACATCGGCTGAGTGCTACGATCTGAGG 

AGCAACCGTGTGGTTGCTGACCAATACTGTCACTATTACCCAGAGAACATCAAACCCAAACC 

CAAGCTTCAGGAGTGCAACTTGGATCCTTGTCCAGCCAGTGACGGATACAAGCAGATCATGC 

CTTATGACCTCTACCATCCCCTTCCTCGGTGGGAGGCCACCCCATGGACCGCGTGCTCCTCC 

TCGTGTGGGGGGGGCATCCAGAGCCGGGCAGTTTCCTGTGTGGAGGAGGACATCCAGGGGCA 

TGTCACTTCAGTGGAAGAGTGGAAATGCATGTACACCCCTAAGATGCCCATCGCGCAGCCCT 

GCAACATTTTTGACTGCCCTl^AATGGCTGGCACAGGAGTGGTCTCCGTGCACAGTGACATGT 

GGCCAGGGCCTCAGATACCGTGTGGTCCTCTGCATCGACCATCGAGGAATGCACACAGGAGG 

CTGTAGCCCAAAAACAAAGCCCCACATAAAAGAGGAATGCATCGTACCCACTCCCTGCTATA 

AACCCAAAGAGAAACTTCCAGTCGAGGCCAAGTTGCCATGGTTCAAACAAGCTCAAGAGCTA 

GAAGAAGGAGCTGCTGTGTCAGAGGAGCCCTCGTAAGTTGTAAAAGCACAGACTGTTCTATA 

TTTGAAACTGTTTTGTTTAAAGAAAGCAGTGTCTO^CTGGTTGTAGCTTTCATGGGTTCTGA 

ACTAAGTGTAATCa^TCTCACCAAAGCTTTTTGGCTCTCAAATTAAAGATTGATTAGTTTCAA 

AAAAAAAAA 



FIGURE 120 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA58847 
<siabxanit 1 of 1, 525 aa, 1 stop 
<MW: 58416, pi: 6.62, NX{S/T) : 1 

MECCRRATPGTLLLFLAFLLLSSRTARSEEDRDGLWDAWGPWSECSRTCGGGASYSLRRCLS 
SKSCEGRNIRYRTCSNVDCPPEAGDFI^QQCSAHNDVKHHGQFYEWLPVSNDPDNPCSLKCQ 
AKGTTLVVELAPKVLDGTRCYTESLDMCISGLCQIVGCDHQLGSTVKEDNCGVCNGDGSTCR 
LVRGQYKSQLSATKSDDTWALPYGSRHIRLVLKGPDHLYLETKTLQGTKGENSLSSTGTFL 
VDNSSVDFQKFPDKEILRMAGPLTADFIVKIRNSGSADSTVQFIFYQPIIHRWRETDFFPCS 
ATCGGGYQLTSAECYDLRSNRWADQYCHYYPENIKPKPKLQECNLDPCPASDGYKQIMPYD 
LYHPLPRWEATPWTACSSSCGGGIQSRAVSCVEEDIQGHVTSVEEWKCMYTPKMPIAQPCNI 
FDCPKWLAQEWSPCTVTCGQGLRYRWLCIDHRGMHTGGCSPKTKPHIKEECIVPTPCYKPK 
EKLPVEAKLPWFKQAQELEEGAAVSEEPS 

Important features: 
Signal peptide: 

amino acids 1-25 

N-glycosylation site, 
amino acids 251-254 

Thrombospondin 1 

amino acids 385-399 

von Willebrand factor type C domain proteins 

amino acids 385-399, 445-459 and 42-56 



FIGURE 121 

CGGACGCGTGGGCGGCGGCTGCGGAACTCCCGTGGAGGGGCCGGTGGGCCCTCGGGCCTGAC 

AG ATGG CAGTGGCCACTGCGGCGGCAGTACTGGCCGCTCTGGGCGGGGCGCTGTGGCTGGCG 

GCCCGCCGGTTCGTGGGGCCCAGGGTCCAGCGGCTGCGCAGAGGCGGGGACCCCGGCCTCAT 

GCACGGGAAGACTGTGCTGATCACCGGGGCGAACAGCGGCCTGGGCCGCGCCACGGCCGCCG 

AGCTACTGCGCCTGGGAGCGCGGGTGATCATGGGCTGCCGGGACCGCGCGCGCGCCGAGGAG 

GCGGCGGGTCAGCTCCGCCGCGAGCTCCGCCAGGCCGCGGAGTGCGGCCCAGAGCCTGGCGT 

CAGCGGGGTGGGCGAGCTCATAGTCCGGGAGCTGGACCTCGCCTCGCTGCGCTCGGTGCGCG 

CCTTCTGCCAGGAAATGCTCCAGGAAGAGCCTAGGCTGGATGTCTTGATCAATAACGCAGGG 

ATCTTCCAGTGCCCTTACATGAAGACTGAAGATGGGTTTGAGATGCAGTTCGGAGTGAACCA 

TCTGGGGCACTTTCTACTCACCAATCTTCTCCTTGGACTCCTCAAAAGTTCAGCTCCCAGCA 

GGATTGTGGTAGTTTCTTCCAAACTTTATAAATACGGAGACATCAATTTTGATGACTTGAAC 

AGTG2^CAAAGCTATAATAZiAAGCTTTTGTTATAGCCGGAGCAAACTGGCTAACATTCTTTT 

TACCAGGGAACTAGCCaSCCGCTTAGAAGGCACAAATGTCACCGTCAATGTGTTGCATCCTG 

GTATTGTACGGACAAATCTGGGGAGGOICATACACATTCCACTGTTGGTGAJ^C 

AATTTGGTGTCATGGGCTTTTTTCAAAACTCCAGTAGAAGGTGCCCT^GACTTC^ 

GGCCTCTTCACCTGAGGTAGAAGGAGTGTCAGGAAGATACTTTGGGGATTGTAAAGAGGAAG 

AACTGTTGCCCAAAGCTATGGATGAATCTGTTGCAAGAAAACTCTGGGATATCAGTGAAGTG 

ATGGTTGGCCTGCTAAA ATAGG AACAAGGAGTAAAAGAGCTGTTTATAAAACTGCATATCAG 

TTATATCTGTGATCAGGAATGGTGTGGATTGAGAACTTGTTACTTGAAGAAAAAGAATTTTG 

ATATTGGAATAGCCTGCTAAGAGGTACATGTGGGTATTTTGGAGTTACTGAAAAATTATTTT 

TGGGATAAGAGAATTTCAGCAAAGATGTTTTAAATATATATAGTAAGTATAATGAATAATAA 

GTACAATGAAAAATACAATTATATTGTAAAATTATAACTGGGCAAGCATGGATGACATATTA 

ATATTTGTCAGAATTAAGTGACTCAAAGTGCTATCGAGAGGTTTTTCAAGTATCTTTGAGTT 

TCATGGCCAAAGTGTTI^CTAGTTTTACTACAATGTTTGGTGTTTGTGTGGAAATTATCTGC 

CTGGTGTGTGCACACAAGTCTTACTTGGAATAAATTTACTGGTAC 



FlgPRE 122 



</usr/ seqdb2/sst/DNA/Dnaseqs .min/ss -DNA58747 
<subimit 1 of 1, 336 aa, 1 stop 
<MW; 36865, pi: 9.15, NX(S/T) : 2 

MAVATAAAVLAALGGALWLAARRFVGPRVQRIJaiGGDPGLmGKTVLITGMTSGI^ 

LLRLGARVIMGCRDRARAEEAAGQIJlREIiRQJ^CGPEPGVSGVGELIVREIJDLASLRSVRA 

FCQEMLQEEPRLDVLINNAGIFQCPYMKTEDGFEMQFGVNHLGHFLLTNLLLGLLKSSAPSR 

IVWSSKLYKYGDINFDDIJJJSEQSYNKSFCTSRSKLMIILFTREIARRI^GTNV^^ 

IVRTNLGRHIHIPLLVKPLFNLVSWAFFKTPVEGAQTSIYLASSPEVEGVSGRYFGDCKEEE 

LLPKAMDESVARKLWDI SEVMVGLLK 

Important features: 
Signal peptide: 

amino acids 1-21 

Short-chain alcohol dehydrogenase family protein 

amino acids 134-144, 44-56 and 239-248 

N-glycosylation site. 

amino acids 212-215 and 239-242 



FIGURE 123 

GGGGATTGTAAAGAGGAAGNACTGTGCCCAAAGNTATGGATGAATCTGTTGCAAGAAAATTN 
TGGGATATCAGTGAAGTGATGGTTNGCCTGCTAAAATAGGAACAAGGAGTAAAAGAGCTGTT 
TATAAAACTGCATATCAGTTATATCTGTGATCAGGAATGGTGTGGATTGAGAACTTGTTACT 
TGAAGAAAlkAGAATTTTGATATTGGAATAGCCTGNTAAGAGGNACATGTGGGTATTTTGGAG 
TTACTGAAAAATTATTTTTGGGATAAGAGAATTTCAGCAAAGATGTTTTAAATATATATAGT 
AAGTATAATGAATAATAAGTACAATGAAAAATACAATTATATTGTAAAATTATAACTGGGCA 
AGCATGGATGACATATTAATATTTGTCAGl^TTAAGTGACTCAAAGTGCTATCGAGAGGTTT 
TTCAAGTATCTTTGAGTTTCATGGCCAAAGTGTTAACTAGTTTTACTACAATGTTTGGTGTT 
TGTGTGGAAATTATCTGCCTGGCTT 



FIGURE 124 



GAGAGGACGAGGTGCCGCTGCCTGGAGAATCCTCCGCTGCCGTCGGCTCCCGGAGCCCA.GCC 
CTTTCCTAACCCAACCCAACCTAGCCCAGTCCCAGCCGCCAGCGCCTGTCCCTGTCACGGAC 
CCCAGCGTTAC CATGC ATCCTGCCGTCTTCCTATCCTTACCCGACCTCAGATGCTCCCTTCT 
GCTCCTGGTAACTTGGGTTTTTACTCCTGTAACAACTGAAATAACAAGTCTTGCTACAGAGA 
ATATAGATGAAATTTTAAACAATGCTGATGTTGCTTTAGTAAATTTTTATGCTGACTGGTGT 
CGTTTCAGTCAGATGTTGCATCCAATTTTTGAGGAAGCTTCCGATGTCATTAAGGAAGAATT 
TCCAAATGAAAATCAAGTAGTGTTTGCCAGAGTTGATTGTGATCAGCACTCTGACATAGCCC 
AGAGATACAGGATAAGCAAATACCCAACCCTCAAATTGTTTCGTAATGGGATGATGATGAAG 
AGAGAATACAGGGGTCAGCGATCAGTGAAAGCATTGGCAGATTACATCAGGCAACAAAAAAG 
TGACCCCATTCAAGAAATTCGGGACTTAGCAQAAATCACCACTCTTGATCGCAGCAAAAGAA 
ATATCATTGGATATTTTGAGCAA2\AGGACTCGGACAACTATAGAGTTTTTGAACGAGTAGCG 
AATATTTTGCATGATGACTGTGCCTTTCTTTCTGCATTTGGGGATGTTTCAAAACCGGAAAG 
ATATAGTGGCGACAACATAATCTACAAACCACCAGGGCATTCTGCTCCGGATATGGTGTACT 
TGGGAGCTATGACAAATTTTGATGTGACTTACAATTGGATTCAAGATAAATGTGTTCCTCTT 
GTCCGAGAAATAACATTTGAAAATGGAGAGGAATTGACAGAAGAAGGACTGCCTTTTCTCAT 
ACTCTTTCACATGAAAGAAGATACAGAAAGTTTAGAAATATTCCAGAATGAAGTAGCTCGGC 
AATTAATAAGTGAAAAAGGTACAATAAACTTTTTACATGCCGATTGTGACAAATTTAGACAT 
CCTCTTCTGCACATACAGAAAACTCCAGCAGATTGTCCTGTAATCGCTATTGACAGCTTTAG 
GCATATGTATGTGTTTGGAGACTTCAAAGATGTATTAATTCCTGGAAAACTCAAGCAATTCG 
TATTTGACTTACATTCTGGAAAACTGCACAGAGAATTCCATCATGGACCTGACCCAACTGAT 
ACAGCCCCAGGAGAGCAAGCCCAAGATGTAGCAAGCAGTCCACCTGAGAGCTCCTTCCAGAA 
ACTAGCACCCAGTGAATATAGGTATACTCTATTGAGGGATCGAGATGAGCTTTAAAAACTTG 
AAAAACAGTTTGTAAGCCTTTC^CAGCAGCATCAACCTACGTGGTGGAAATAGTAAACCTA 
TATTTTOVrAATTCTATGTGTATTTTTATTTTGAATAAACAGAAAGAAATTTAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 125 

< / usr / s eqdb2 / s s t /DNA/Dnaseqs - min/ ss . DNA5 7689 
<siabunit 1 of 1, 406 aa, 1 stop 
<MW: 46927, pi: 5.21, NX(S/T) : 0 

mPAVFLSLPDLRCSLLLLVTWWTPVTTEITSLATENIDEILNNADVALVNFYADWCRFSQ 
MLHPI FEEASDVI KEEFPNENQWPAKVDCDQHSDI AQRYRI SKYPTLKLFRNGINIMMKREYR 
GQRSVKALADYIRQQKSDPIQEIIU3LAEITTLDRSKRNIIGYFEQKDSDNYRVFERVANILH 
DDCAFLSAFGDVSKPERYSGDNIIYKPPGHSAPDMVYLGAMTNFDVTYNWIQDKCVPIiVREI 
TFENGEELTEEGLPFLILFHMKEDTESLEIFQNEVARQLISEKGTINFLHADCDKFRHPIiliH 
IQKTPADCPVIAIDSFRHMYVFGDFKDVLIP6KLKQFVFDLHSGKLHREFHHGPDPTDTAPG 
EQAQDVASSPPESSFQKLAPSEYRYTLLRDRDEL 

Important features; 
Signal peptide: 

amino acids 1-29 

Endoplasmic reticulum targeting sequence, 
amino acids 403-406 

Tyrosine kinase phosphorylation site. 

amino acids 203-211 

Thioredoxin family proteins 
amino acids 50-66 



FIGURE 126 



ATTAAGGAAGAATTTCCAAATGAAAATCAAGTAGTNTTTGCCAGAGTNGATTGTGATCAGCA 

CTCTGAC^TAGCCCAGAGATACAGGATAAGCAJ^TACCCAACCCTOy^TTG 

GGATGAT6ATGAAGAGAGAATACAGGGGTCAGCGATCAGTGAAAGCATTGGCAGATTA 



FIGURE 127 



AGAGGCCTCTCTGGZ^GTTGTCCCGGGTGTTCGCCGCNGGAGCCCGGGTCGAGAGGACasrAGG 
TGCCGCTGCCTGGAGAATCCTCCGCTGCCGTCGGCTCCCGGAGCCCAGCCCTTTCCTAACCC 
AACCCRACCTAGCCCNGTCCCa^GCCGCCAGCGCCTGTCCCTGTCNCGGANCCCAGCGTNACC 
ATGCATCCTGCCGTCTTCCTATCCTTACCCGACCTCAGATGCTCCCTTCTGCTCCTGGTAAC 
TTGGGTTTTTACTCCTGTAACAACTGAAATAAOJNGTCTTGATAaJNAGAATATAGATGAAA 
TTTTAAAOiJATGCTGATGTGGCTTTAGTCAATTTTTATGCTGACTGGTGTCGTTTCAGTCAG 
ATGTGGCATCCa^TTTTTGAGGANGCTTCCGATGTCATTAAGGAAGAATTTCCAAATGAAAA 
TCAAGTAGTGTTTGCCAGAGTTGATTGTGATCAGCACTCTGACATAGCCCAGAGATACAGGA 
TAAGCAAATACCCAACCCTCAAATTGTTTCGTAATGGGATGATGATGAAGAGAGAATACAGG 
GGTCAGCGATCAGTGAAAGCATTGGCAGATTACATCAGGC 



FIGURE 128 



GCCCACGCGTCCGATCGCGTTCACGTTCGCGGCCTTCTGCTACATGCTGGCGCTGCTGCTCA 
CTGCCGCGCTC:a.TCTTCTTCGCCATTTGGCA<:ATTATAGC:3^TTT6A.TGAGCT6Z^GACTGAT 
TACAAGAATCCTATAGACCAGTGTAATACCCTGAATCCCCTTGTACTCCCAGAGTACCTCAT 
CCACGCTTTCTTCTGTGTCATGTTTCTTTGTGCAGCAGAGTGGCTTACACTGGGTCTCAATA 
TGCCCCTCTTGGCATATCATATTTGGAGGTATATGAGTAGACCAGTGATGAGTGGCCCAGGA 
CTCTATGACCCTACAACCATCATGAATGCAGATATTCTAGCATATTGTCAGAAGGAAGGATG 
GTGCAAATTAGCTTTTTATCTTCTAGCATTTTTTTACTACCTATATGGCATGATCTATGTTT 
TGGTGAGCTCTTAGAACAACACACAGAAGAATTGGTCCAGTTAAGTGCATGCAAAAAGCCAC 
CAAATGAAGGGATTCTATCCAGCAAGATCCTGTCCAAGAGTAGCCTGTGGAATCTGATCAGT 
TACTTTAAAAAATGACTCCTTATTTTTTAAATGTTTCCACATTTTTGCTTGTGGAAAGACTG 
TTTTCATATGTTATACTCAGATAAAGATTTTAAATGGTATTACGTATAAATTAATATAAAAT 
GATTACCTCTGGTGTTGACAGGTTTGAACTTGCACTTCTTAAGGAACAGCCATAATCCTCTG 
AATGATGCATTAATTACTGACTGTCCTAGTACATTGGAAGCTTTTGTTTATAGGAACTTGTA 
GGGCTCATTTTGGTTTCATTGAAACAGTATCTAATTATAAATTAGCTGTAGATATCAGGTGC 
TTCTGATGAAGTGAAAATGTATATCTGACTAGTGGGAAACTTCATGGGTTTCCTCATCTGTC 
ATGTCGAT6ATTATATATGGATACATTTACAAAAATAAAAAGCGGGAATTTTCCCTTCGCTT 
GAATATTATCCCTGTATATTGCATGAATGAGAGATTTCCCATATTTCCATCAGAGTAATAAA 
TATACTTGCTTTAATTCTTAAGCATAAGTAAACATGATATAAAAATATATGCTGAATTACTT 
GTGAAGAATGCATTTAAAGCTATTTTAAATGTGTTTTTATTTGTAAGACATTACTTATTAAG 
AAATTGGTTATTATGCTTACTGTTCTAATCTGGTGGTAAAGGTATTCTTAAGAATTTGCAGG 
TACTACAGATTTTCAAAACTGAATGAGAGAAAATTGTATAACCATCCTGCTGTTCCTTTAGT 
GCAATACAATAAAACTCTGAAATTAAGACTC 



FIGURE 129 

</usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA23330 
<subtmit 1 of 1, 144 aa, 1 stop 
<MW: 16699, pi: 5.60, NX(S/T) : 0 

MAFTFAAFCnnaLALLLTAALIFFAIVrailAFDELKTDYKNPIDQasrrm 

CVMFLCAAEWLTLGLmPLLAYHIWRYMSRPVMSGPGLYDPTTimiUDIIiAYCQKEGWCKI^ 

FYLLAFFYYLYGMIYVLVSS 

Important features! 
Signal peptide: 
amino acids 1-20 

Type II transmembrane domain: 

amino acids 11-31 

Other transmembrane domain: 
amino acids 57-77 and 123-143 



FIGURE 13 Q 



ATTATAGCATTTGATGAGCTGAAGACTGATTACAAGATCCTATAGACCAGTGTAATACCCTG 
AATCCCCTTGTACTCCC^GAGTACCTCATCCACGCTTTCn^CTGTGTCATGTTTCTTTGTGC 
AGCAGAGTGGCTTACACTGGGTCTCAATATGCCCCTCTTGGCATATCATATTTGGAGGTATA 
TGAGTAGACCAGTGATGAGTGGCCCAGGACTCTATGACCCTACAACCATCATGAATGCAGAT 
ATTCTAGCATATTGTCAGAAGGAAGGATGGTGCAAATTAGCTTTTTATCTTCTAGCATTTTT 
TTACTACCTATATGGCATGATCTATGTTTTGGTGAGCTCTTAGAACAACACACAGAAGAATT 
GGTCCAGTTAAGTGCATGCAAAAAGCCACCAAATGAAGGGATTCTATCCAGCAAGATCCTGT 
CCAAGAGTAGCCTGTGGAATCTGATCAGTTACTTTAAAAAATG 



FIGURE 131 



CGGACGCGTGGGGGAAACCCTTCCGAGAAl^CAGCAACAAGCTGAGCTGCTGTGACAGAGGG 
GAACAAGATGGCGGCGCCGAAGGGGAGCCTCTGGGTGAGGACCCAACTGGGGCTCCCGCCGC 
TGCTGCT6CTGACCATGGCCTTGGCCGGAGGTTCGGGGACCGCTTCGGCTGAAGCATTTGAC 
TCGGTCTTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAGTTGACCTACCCCTTGCACAC 
CTACCCTAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTGCAGGCTGTTTTCAATTTGTC 
AGTTTGTGGATGATGGAATTGACTTAAATCGAACTAAATTGGAATGTGAATCTGCATGTACA 
GAAGCATATTCCCiyiTCTGATGAGCAATATGCTTGCCATCTTGGTTGCCAGAATCAGCTGCC 
ATTCGCT6AACTGAGACAAGAACAACTTATGTCCCTGATGCCAAAAATGCACCTACTCTTTC 
CTCTAACTCTGGTGAGGTCATTCTGGAGTGACATGATGGACTCCGCACAGAGCTTCATAACC 
: TCTTCATGGACTTTTTATCTTCAAGCCGATGACGGAAAAATAGTTATATTCCAGTCTAAGCC 

^ AGAAATCCAGTACGCACCACATTTGGAGCAGGAGCCTACAAATTTGAGAGAATCATCTCTAA 
GCAAAATGTCCTATCTGCAAATGAGAAATTCACAAGCGCACAGGAATTTTCTTGAAGATGGA 

h GAAAGTGATGGCTTTTTAAGATGCCTCTCTCTTAACTCTGGGTGGATTTTAACTACAACTCT 
TGTCCTCTCGGTGATGGTATTGCTTTGGATTTGTTGTGCAACTGTTGCTACAGCTGTGGAGC 

: . AGTATGTTCCCTCTGAGAAGCTGAGTATCTATGGTGACTTGGAGTTTATGAATGAACAAAAG 

Ci CTAAACAGATATCCAGCTTCTTCTCTTGTGGTTGTTAGATCTAAl^CTGAAGATCATGAAQA 

P_! 

^. AGCAGGGCCTCTACCTACAAAAGTGAATCTTGCTCATTCTGAAAT TTAA GCATTTTTCTTTT 



AAAAGACAAGTGTAATAGACATCTAAAATTCCACTCCTCATAGAGCTTTTAAAATGGTTTCA 
TTGGATATAGGCCTTAAGAl^TCACTATAAAATGCAAATAAAGTTACTCAAATCTGTG 



FIGURE 132 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA26847 
<STibunit 1 of 1, 323 aa, 1 stop 
<MW: 36223, pi: 5.06, NX<S/T) : 1 

MAAPKGSLWVRTQLGLPPLLLLTMALAGGSGTASAEAFDSVLGDTASCHRACQLTYPLHTY 

KEEELYACQRGCRLFSICQFVDDGIDLNRTKLECESACTEAYSQSDEQYACHLGCQNQLPFA 

ELRQEQLMSLMPKlVfflLLFPLTLWSFWSDMMDSAQSFITSSWTFYLQADDGKIVIFQSKPEI 

QYAPHLEQEPTNLRESSLSKMSYLQMRNSQAHRNFLEDGESDGFLRCLSLNSGWILTTTLVL 

SVMVLLWICCATVATAVEQYVPSEKLSIYGDLEFMNEQKLNRYPASSLVVVRSKTEDHEEAG 

PLPTK\7NLAHSEI 

Important features: 
Signal peptide: 
amino acids 1-31 

Transmembrane domain: 

amino acids 241-260 

N-glycosylation site, 
amino acids 90-93 



FIGURE 133 



TTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAGTTGACCTACCCCTTGCACACCTACCC 
TAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTGCAGGCTGTTTTCAATTTGTCAGTTTG 
TGGATGATGGAATTGACTTAAATCGAACTAAATTGGAATGTGAATCTGCATGTACAGAAGCA 
TATTCCC^TCTGATGAGCAATATGCTTGCCATCTTGGTTGCCAGAATCAGCTGCCATTCGC 
TGAACTGAGACAAGAAO^CTTATGTCCCTGATGCCAAAAATGCACCTACTCTTTCCTCTAA 
CTCTGGTGAGGTCATTCTGQAGTGACATGATGGACTCCGC 



FIQURE 134 



O^CACTGGCCGGATCTTTTAGAGTCCTTTGACCTTGACCaAGGGTaJGGAAAACAGa^CAA 
GCTGAGCTGCTGTGACAGAGGGAACaAGATGGCGGCGCCGAAGGGAGCCTTTGGGTGAGGAC 
CCAACTGGGGCTCCCGCCGCTGCTGCTGCTGACCATGGCCTTGGCCGGAGGTTCGGGGACCG 
CTTCGGCTGAAGCATTTGACTCGGTCTTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAG 
TTGACCTACCCCTTGCACACCTACCCTAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTG 
CAGGCTGTTTTCAATTTGTCAGTTTGTGGATGATGGAATTGACTTAAATCGAACTAAATTGG 
AATGTGAATCTGCATGTACAGAAGCATATTCCCAATCTGATGAGCAATATGCTTGCCATCTT 
GGTTGCCAGAATCAGCTGCCATTCGCTGAACTGAGACAAGAACAACTTATGTCCCTGATGCC 
AAAAATGCACCTACTCTTTCCTCTAACTCTGGTGAGGTCATTCTGGAGTGACATGATGGACT 
CCGC 



FIGURE 135 



GCGAGGTGGCGATCGCTGAGAGGCAGGAGGGCCGAGGCGGGCCTGGGAGGCGGCCCGGAGGT 
GGGGCGCCGCTGGGGCCGGCCCGCACGGGCTTCATCTGAGGGCGCACGGCCCGCGACCGAGC 
GTGCGGACTGGCCTCCCaAGCGTGGGGCGACAAGCTGCCGGAGCTGCAATGGGCCGCGGCT^ 
GGGATTCTTGTTTGGCCTCCTGGGCGCCGTGTGGCTGCTCAGCTCGGGCCACGGAGAGGAGC 
AGCCCCCGGAGACAGCGGCACAGAGGTGCTTCTGCCAGGTTAGTGGTTACTTGGATGATTGT 
ACCTGTGATGTTGAAACCATTGATAGATTTAATAACTACAGGCTTTTCCCAAGACTACAAAA 
ACTTCTTGAAAGTGACTACTTTAGGTATTACAAGGTAAACCTGAAGAGGCCGTGTCCTTTCT 
GGAATGACATCAGCCAGTGTGGAAGAAGGGACTGTGCTGTCAAACCATGTCAATCTGATGAA 
GTTCCTGATGGAATTAAATCTGCGAGCTACAAGTATTCTGAAGAAGCCAATAATCTCATTGA 
AGAATGTGAACAAGCTGAACGACTTGGAGCAGTGGATGAATCTCTGAGTGAGGAAACACAGA 
AGGCTGTTCTTCAGTGGACCAAGCATGATGATTCTTCAGATAACTTCTGTGAAGCTGATGAC 
ATTCAGTCCCCTGAAGCTGAATATGTAGATTTGCTTCTTAATCCTQAGCGCTACACTGGTTA 
C^GGGACCAGATGCTTGGAAAATATGGAATGTCATCTACGAAGAAAACTGTTTTAAGCCAC 
AGACaATTAAAAGACCTTTAAATCCTTTGGCTTCTGGTCAAGGGAC^GTGAAGAGAACACT 
TTTTACAGTTGGCTAGAAGGTCTCTGTGTAGAAAAAAGAGCATTCTACAGACTTATATCTGG 
CCTACATGCAAGCATTAATGTGCATTTGAGTGCAAGATATCTTTTACAAGAGACCTGGTTAG 
AAAAGAAATGGGGACACAACATTACAGAATTTCAACAGCGATTTGATGGAATTTTGACTGAA 
GGAGAAGGTCCAAGAAGGCTTAAGAACTTGTATTTTCTCTACTTAATAGAACTAAGGGCTTT 
ATCCAAAGTGTTACCATTCTTCGAGCGCCCAGATTTTCAACTCTTTACTGGAAATAAAATTC 
AGGATGAGGAAAACAAAATGTTACTTCTGGAAATACTTCATGAAATCAAGTCATTTCCTTTG 
CATTTTGATGAGAATTCATTTTTTGCTGGGGATAAAAAAGAAGCACACAAACTAAAGGAGGA 
CTTTCGACTGCATTTTAGAAATATTTCAAGAATTATGGATTGTGTTGGTTGTTTTAAATGTC 
GTCTGTGGGGAAAGCTTCAGACTCAGGGTTTGGGCACTGCTCTGAAGATCTTATTTTCTGAG 
AAATTGATAGCAAATATGCCAGAAAGTGGACCTAGTTATGAATTCCATCTAACCAGACAAGA 
AATAGTATCATTATTCAACGCATTTGGAAGAATTTCTACAAGTGTGAAAGAATTAGAAJ^CT 
TCAGGAACTTGTTACAGAATATTa^lTAaAGAAAACAAGCTGAT^ 

AATGGAGGCGAAAGAGTGGAATTTCATTCAAAGGCATAATAGCAATGACAGTCTTAAGCCAA 
ACATTTTATATAAAGTTGCTTTTGTAAAGGAGAATTATATTGTTTTAAGTAAACACATTTTT 
AAAAATTGTGTTAAGTCTATGTATAATACTACTGTGAGTAAAAGTAATACTTTAATAATGTG 
GTACAAATTTTAAAGTTTAATATTGAATAAAAGGAGGATTATCAAATTAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 136 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DN2^3974 
<sxabvinit 1 of 1, 468 aa, 1 stop 
<VM: 54393, pi: 5.63, NX{S/T): 2 

MGRGWGFLFGLLGAVWLLSSGHGEEQPPETAAQRCFCQVSGYIJDDCTCDVETIDRFNNYR^ 

PRLQKLLESDYFRYYKmLKRPCPFWNDISQCGRRDCAVKPCQSDEVPDGIKSASYKYSEEA 

NNLIEECEQAERLGAVDESLSEETQKAVLQWTKHDDSSDNFCEADDIQSPEAEYVDLLLNPE 

RYTGYKGPDAWKIWNVIYEENCFKPQTIKRPLNPLASGQGTSEENTFYSWLEGLCVEKRAFY 

RLISGLHASINVHLSARYLLQETWLEKKWGHNITEFQQRFDGILTEGEGPRRLICNLYFLYLI 

ELRALSKVLPFFERPDFQLFTGNKIQDEENKMLLLEILHEIKSFPLHFDENSFFAGDKKEAH 

KLKEDFRLHFKNISRIMDCVGCFKCRLWGKLQTQGLGTALKILFSEKLIANMPESGPSYEFH 

LTRQEIVSLFNAFGRISTSVKELENFRNLLQNIH 

Important features: 
Signal peptide: 
amino acids 1-23 

N-glycosylation site. 

amino acids 280-283 and 384-387 

Amidation site, 
amino acids 94-97 

Glycosaminoglycan attachment site. 

amino acids 20-23 and 223-226 

Aminotransferases class -V pyridoxal -phosphate 

amino acids 216-222 

Interleukin-7 proteins 
amino acids 338-343 



FIGURE 137 



GCTGGAAATATGGATGTCATCTACGAGAAACTGTTTTAAGCCACAGACAATTAAAAGACCTT 
TAAATCCTTTGGCTTCTGGTCAAGGGACAAGTGJ^GAGNACACTTTTTACAGTTGGCTAGAA 
GGTCTCTGTGTAGAAAAAAGAGCATTCTACAGACTTATATCTGGCCTACATGCAAGCATTAA 
TGTGCATTTGAGTGCAAGATATCTTTTACAAGAGACCTGGTTAGAAAAGAAATGGGGACACA 
ACATTACAGAATTTNAACAGCGATTTGATGGAATTTTGACTGAAGGAGAAGGTCCAAGAAGG 
CTTAAGAACTTGTATTTTCTCTACTTAATAGAACTAAGGGCTTTATCCAAAGTGTTACCATT 
CTTNGAGCGCCCAGATTTTCAACTNTTTACTGQAAATAAAATTCAGGATGAGGNAAACAAAA 
TGTTACTTTTGGAAATACTTCATGAAATCAAGTCATTTCCTTTGCATTTTGATGAGAATTCA 
TTTTTTTGCTG 



FIGURE 138 



CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGTTGGGAGGGGGCAGGATGGGAGGGAA 
AGTGAAGAAAACAGAAAAGGAGAGGGACAGAGGCCAGAGGACTTCTCATACTGGACAGAAAC 
CGATC:ai.GGCATGGAACTCCCCTTCGTCACTCACCTGTTCTTGCCCCTGGTGTTCCTGACAGG 
TCTCTGCTCCCCCTTTAACCTGGATGAACATCACCCACGCCTATTCCCAGGGCCACCAGAAG 
CTGAATTTGGATACAGTGTCTTACAACATGTTGGGGGTGGACAGCGATGGATGCTGGTGGGC 
GCCCCCTGGGATGGGCCTTCAGGCGACCGGAGGGGGGACGTTTATCGCTGCCCTGTAGGGGG 
GGCCCACAATGCCCCATGTGCCAAGGGCCACTTAGGTGACTACCAACTGGGAAATTCATCTC 
ATCCTGCTGTGAATATGCACCTGGGGATGTCTCTGTTAGAGACAGATGGTGATGGGGGATTC 
ATGGTGAGCTAAGGAGAGG6TGGTGGCAGTGTCTCTGAAGGTCCATAAAAGAAAAAAGAGAA 
GTGTGGTAAGGGAAAATGGTCTGTGTGGAGGGGTCAAGGAGTTAAAAACCCTAGAAAGCAAA 
AGGTAGGTAATGTCAGGGAGTAGTCTTCATGCCTCCTTCAACTGGGAGCATGTTCTGAGGGT 
GCCCTCCCAAGCCTGGGAGTAACTATTTCCCCCATCCCCAGGCCTGTGCCCCTCTCTGGTCT 
CGTGCTTGTGGCAGCTCTGT(m:<::aGTTCrGGGATATGTGCCC6TGTGGATGCTTCATTCC::A 
GCCTCAGGGAAGCCTGGCACCCACTGCCCAACGTGAGCCAGAGGAAGGCTGAGTACTTGGTT 
CCCAGAA6GAQATACTGGGTGGGAAAAAGATGGGGCAAAGCGGTATGATGCCTGGCAAAGGG 
CCTGCATGGCTATCCTCATTGCTACCTAATGTGCTTGCAAAAGCTCCATGTTTCCTAACAGA 
TTCAGACTCCTGGCCAGGTGTGGTGGCCCACACCTGTAATTCTAGCACTTTGGGAGGCCAAG 
GTGGGCAGATCACTTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACTCCAT 
CTCTACTAAAAAAAAA?^AAATACAA2^AATTAGCTGGGTGCGCTAGTGCATGCCTGTAATCTC 
ATCTACTCGGGAGGCTAAGACAGGAGACTCTCACTTCAACCCAGGAGGTGGAGGTTGCGGTG 
AGCCAAGATTGTGCCTCTGCACTCTAGCGTGGGTGACAGAGTAAGCGAGACTCCATCTCAAA 
AATAATAATAATAATAATTCAGACTCCTTATCAGGAGTCCATGATCTGGCCTGGCACAGTAA 
CTCATGCCTGTAATCCCAACATTTTGGGAGGCCAACGCAGGAGGATTGCTTQAGGTCTGGAG 
GTTTGAGACO^GCCTGGGCAACS^TAGAAAGACCCCATCTCTAAATAAATGTTTTAAAAAT 



FIGURE 139 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA57039 
xsiobunit 1 of 1, 124 aa, 1 stop 
xMW: 13352, pi: 5.99, NX(S/T) : 1 

MELPFVTHLFLPLVFLTGLCSPFNLDEHHPRLFPGPPEAEFGYSVLQHVGGGQRWMLVGAPW 
DGPSGDRRGDVYRCPVGGAHNAPCAKGHLGDYQI.GNSSHPAVl#fflI.GMSIiLETDGIX3GFMV 

Important features: 
Signal peptide: 
amino acids 1-22 

Cell attachment sequence. 

amino acids 70-73 

N-glycosylation site. 

amino acids 98-101 

Integrins alpha chain proteins 

amino acids 67-81 



FIGURE 140 



CT^CAGTTCCCCACCATCACTCan'CCa^TTCCTTCCAACTTTATTTTTAGCTTGCCA^ 

GGGGGCAGGATGGGAGGGAAAGTGAAGAAAACAGAAAAGGAGAGGGACAGAGGCCAGAGGAC 

TTCTCATACTGGACAGAAACCGATCAGGCATGGAACTCCCCTTCGTCACTCACCTGTTCTTG 

CCCCTGGTGTTCCTGACAGGTCTCTGCTCCCCCTTTAACCTGGATGAACATCACCCACGCCT 

ATTCCCAGGGCGACG^GAAGCTQAATTTGGATACAGTGTCTTACAACATGTTGGGGGTC^ 

AGCGATGGATGCTGGTGGGCGCCCCCTGGGATGGGCCTTCAGGCGACCGGAGGGGGGACGTT 

TATCGCTGCCCTGTAGGGGGGGCCCACAATGCCCCATGTGCCAAGGGCCACTTAGGTGACTA 

CCAACTGGGAAATTCATCTCATCCTGCTGTGAATATGCACCTGGGGATGTCTCTGTTAGAGA 

CAGATGGTGATGG 



FIGURE 141 

AAAGTTACATTTTCTCTGGAACTCTCCTAGGCCACTCCCTGCTGATGCAACATCTGGGTTTG 
GGCAGAAAGGAGGGTGCTTCGGAGCCCGCCCTTTCTGAGCTTCCTGGGCCGGCTCTAGAACA 
ATTCAGGCTTCGCTGCGACTCAGACCTCAGCTCCAACATATGCATTCTGAAGAAAGATGGCT 
GAGATGGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGGTCAAACTGAGTCTACCA 
AATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCTTTTCATGTGGTTTTTCT 
ACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGCCTGCCCCTCAGAACCTC 
TCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCA6TGATCGCGCCTGQAGA 
AACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCTGTACACGAGCCACATCT 
GGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTGATGTCACTGATGACATC 
ACGGCCACTGTGCCATACAACCTTCGTGTCAGGGCCACATTGGGCTCACAGACCTCAGCCTG 
GAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTACCCGACCTGGGATGGAGA 
TCACCAAAGATGGCTTCCACCTGGTTATTGAGCTGGAGGACCTGGGGCCCCAGTTTGAGTTC 
CTTGTGGCCTACTGGAGGAGGGAGCCTGGTGCCGAGGAACATGTCAAAATGGTGAGGAGTGG 
GGGTATTCCAGTGCACCTAGAAACCATGGAGCCAGGGGCTGCATACTGTGTGAAGGCCCAGA 
CATTCGTGAAGGCCATTGGGAGGTACAGCGCCTTCAGCCAGACAGAATGTGTGGAGGTGCAA 
GGAGAGGCCATTCCCCTGGTACTGGCCCTGTTTGCCTTTGTTGGCTTCATGCTGATCCTTGT 
GGTCGTGCCACTGTTCGTCTGGAAAATGGGCCGGCTGCTCCA6TACTCCTGTTGCCCCGTGG 
TGGTCCTCCCAGACACCTTGAT^TAACCAATTCACCCCAGAAGTTAATCAGCTGCAGAAGG 
GAGGAGGTGGATGCCTGTGCCACGGCTGTGATGTCTCCTGAGGAACTCCTCAGGGCCTGGAT 
CTCATAGGTTTGCGGAAGGGCCCAGGTGAAGCCGAGAACCTGGTCTGCATGACATGGAAACC 
ATGAGGGGACAAGTTGTGTTTCTGTTTTCCGCCACGGACAAGGGATGAGAGAAGTAGGAAGA 
GCCTGTTGTCTACAAGTCTAGAAGCAACCATCAGAGGCAGGGTGGTTTGTCTAACAGAACAC 
TGACTGAGGCTTAGGGGATGTGACCTCTAGACTGGGGGCTGCCACTTGCTGGCTGAGCAACC 
CTGGGAAAAGTGACTTCATCCCTTCGGTCCTAAGTTTTCTCATCTGTAATGGGGGAATTACC 
TACACACCTGCTAAACACACACACACAGAGTCTCTCTCTATATATACACACGTACACATAAA 
TACACCCAGCACTTGCAAGGCTAGAGGGAAACTGGTGACACTCTACAGTCTGACTGATTCAG 
TGTTTCTGGAGAGCAGGACATAAATGTATGATGAGAATGATCAAGGACTCTACACACTGGGT 
GGCTTGGAGAGCCCACTTTCCCAGAATAATCCTTGAGAGAAAAGGAATCATGGGAGCAATGG 
TGTTGAGTTCACTTCAAGCCCAATGCCGGTGCAGAGGGGAATGGCTTAGCGAGCTCTACAGT 
AGGTGACCTGGAGGAAGGTCACAGCCACACTGAAAATGGGATGTGCATGAACACGGAGGATC 
CATGAACTACTGTAAAGTGTTGACAGTGTGTGCACACTGCAGACAGCAGGTGAAATGTATGT 
GTGCAATGCGACGAGAATGCAGAAGTCAGTAACATGTGCATGTTTGTTGTGCTCCTTTTTTC 
TGTTGGTAAAGTACAGJ^TTCAGCAAATAAAAAGGGCCACCCTGGCCAAAAGCGGTAAAAAA 
AAAAAAAAAA 



FIGtJRE 142 

< / US r / s eqdb2 / s s t /DNA/Dnaseqs . min/ s s . DNA5 7 0 3 3 
<stibiinit 1 of 1, 311 aa, 1 stop 
<MW: 35076, pi: 5.04, NX(S/T) : 2 

MQTFTMVLEEIWTSLFMWFFYALIPCLLTDEVAILPAPQNLSVLSTNMKHLLMWSPVIAPGE 
TVYYSVEYQGEYESLYTSHIWIPSSWCSLTEGPECnDVTDDITATVPYNLRVRATLGSQTSAW 
SILKHPFNRNSTILTRPGMEITKDGFHLVIELEDLGPQFEFLVAYWRREPGAEEHVKMVRSG 
GI PVHLETMEPGAAYCVKAQTFVKAIGRYSAFSQTECVEVQGEAI PLVLALFAFVGFML I LV 
VVPLFVWKMGRLLQYSCCPVVVLPDTLKITNSPQKLISCRREEVDACATAVMSPEELLRAWIS 

Important features: 
Signal peptide: 
amino acids 1-29 

TraLnsmembrane domain: 
amino acids 230-255 

N-glycosylation site. 

amino acids 40-43 and 134-137 

Tissue factor proteins, 
amino acids 92-119 



Xntegrins alpha chain proteins 

amino acids 232-262 



FIGURE 143 



TCCTGCTGATGCAO^TCTGGGTTTGGCAAAAGGAGGTTGCTTCGAGCCGCCCTTTCTAGCTT 
CCTGGCCGGCTCTAGAACAATTCAGGCTTCGCTGCGACTAGACCTCAGCTCCAACATATGCA 
TTCTGAAGAAAGATGGCTGAGATGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGG 
TCAAACTGAGTCTACCAAATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCT 
TTTCATGTGGTTTTTCTACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGC 
CTGCCCCTCAGAACCTCTCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCA 
GTGATCGCGCCTGGAGAAACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCT 
GTACACGAGCCACATCTGGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTG 
ATGTCACTGATGACATCACGGCCACTGTGCCATACAACCTTTGTGTCAGGGCCACATTGGGC 
TCACAGACCTCAGCCTGGAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTAC 
CCGACCTGGGATGGAGATCACCAAAGATGGCTTNCACCTGGTTATTGAGCTGGAGGACCTGG 
GGCCCCAGTTTGAGTTCCTTGTGGCCTANTGGAGGAGGGGCGAACCCCTTGCGGCGCAAGGG 
GTTNGCGAACCCCTTGCGGCCGCTGGGGTATCTCTCGAGAAAAGAGAGGCCCAATATGACCC 
ACATACTCAATATGGACGAAlSrrGCTATTGTCCACCTGTTTGAGTGGCGCrGGGTTGA 



FIGURE 144 



CCCACGCGTCCGCCCACGCGTCCGAGGGACAAGAGAGAAGAGAGACTGAAACAGGGAGAAGA 
GGCAGGAQAGGAG6AGGTGGGGAGAGCACGAAGCTGGAGGCCGACACTGAGGGAGGGCGGGA 
GGAGGTGAAGAAGGAGAGAGGGGAGAAGAGGCAG6AGCTGGAAAGGAGAGAGGGAGGAGGAG 
GAGGAGATGCGGGATGGAGACCTGGAGTTAGGTGGCTTGGGAGAGCTTAAT(5AAAAGAGAAC 
GGAGAGGAGGTGTGGGTTAGGAACCAAGAGGTAGCCCTGTGGGCAGCAGAAGGCTGAGAGGA 
GTAGGAAGATCAGGAGCTAGAGGGAGACTGGAGGGTTCCGGGAAAAGAGCAGAGGAAAGAGG 
AAAGACACAGAGAGACGGGAGAGAGAAGAAGAGTGGGTTTGAAGGGCGGATCTCAGTCCCTG 
GCTGCTTTGGCATTTGGGGAACTGGGACTCCCTGTGGGGAGGAGAGGAAAGCTGGAAGTCCT 
GGAGGGACAGGGTCCCAGAAGGAGGGGACAGAGGAGCTGAGAGAGGGGGGCAGGGCGTTGGG 
CAGGGGTCCCTCGGAGGCCTCCTGGGGATGGGGGCTGCAGCTCGTCTGAGCGCCCCTCGAGC 
GCTGGTACTCTGGGCTGCACTGGGGGCAGCAGCTCACATCGGACCAGCACCTGACCCCGAGG 
ACTGGTGGAGCTACAAGGATAATCTCCAGGGAAACTTCGTGCCAGGGCCTCCTTTCTGGGGC 
CTGGTGAATGCAGCGTGGAGTCTGTGTGCTGTGGGGAAGCGGCAGAGCCCCGTGGATGTGGA 
GCTGAAGAGGGTTCTTTATGACCCCTTTCTGCCCCCATTAAGGCTCAGCACTGGAGGAGAGA 
AGCTCCGGGGAACCTTGTACSkACACCGGCCGACIRLTGTCTCCTTCCTGCCTGCACCCCGACCT 
GTGGTCAATGTGTCTGGAGGTCCCCTCCTTTACAGCCACCGACTCAGTGAACTGCGGCTGCT 
GTTTGGAGCTCGCGACGGAGCCGGCTCGGAACATCAGATCAACCACCAGGGCTTCTCTGCTG 
AGGTGCAGCTCATTCACTTCAACCAGGAACTCTACGGGAATTTCAGCGCTGCCTCCCGCGGC 
CCCAATGGCCTGGCCATTCTCAGCCTCTTTGTCAACGTTGCCAGTACCTCTAACCCATTCCT 
CAGTCGCCTCCTTAACCGCGACACCATCACTCGCATCTCCTACAAGAATGATGCCTACTTTC 
TTCAAGACCTGAGCCTGGAGCTCCTGTTCCCTGAATCCTTCGGCTTCATCACCTATCAGGGC 
TCTCTCAGCACCCCGCCCTGCTCCGAGACTGTCACCTGGATCCTCATTGACCGGGCCCTCAA 
TATCACCTCCCTTCAGATGCACTCCCTGAGACTCCTGAGCCAGAATCCTCCATCTCAGATCT 
TCCAGAGCCTCAGCGGTAACAGCCGGCCCCTGCAGCCCTTGGCCCACAGGGCACTGAGGGGC 
AACAGGGACCCCCGGCACCCCGA6AGGCGCTGCCGAGGCCCCAACTACCGCCTGCATGTGGA 
TGGTGTCCCCCATGGTCGCTGAGACTCCCCTTCGAGGATTGCACCCGCCCGTCCTAAGCCTC 
CCCACAAGGCGAGGGGAGTTACCCCTAAAACAAAGCTATTAAAGGGACAQAATACTTA 



FIGURE 145 



< / us r / s eqdb2 / s s t /DNA/Dnaseqs . min/ s s . DNA3 4 3 53 
<siibijnit 1 of 1, 328 aa, 1 stop 
<MW: 36238, pi: 9.90, NXCS/T) : 3 

MGAAARLSAPRALVLWAALGAAAHIGPAPDPEDWWSYKDNLQGNFVPGPPFWGLVNAAWSLC 
AVGKRQSPVDVELKRVLYDPFLPPLRLSTGGEKLRGTLYNTGRHVSFLPAPRPVVNVSGGPL 
LYSHRLSELRLLFGARDGAGSEHQINHQGFSAEVQLIHFNQELYGNFSAASRGPNGLAILSL 
FVNVASTSNPFLSRLLNRDTITRISYKNDAYFLQDLSLELLFPESFGFITYQGSLSTPPCSE 
TVTWILIDRALNITSLQMHSLRLLSQNPPSQIFQSLSGNSRPLQPLAHRALRGNRDPRHPER 
RCRGPNYRLHVDGVPHGR 

Important features: 
Signal peptide: 

amino acids 1-23 

Tr an smeiribrane domain: 

amino acids 177-199 

N-glycosylation site. 

amino acids 118-121, 170-173 and 260-263 

Eukaryo tic -type carbonic anhydrases proteins 

amino acids 222-270, 128-164 and 45-92 



FIGURE 146 



GGCGCCTGGTTCTGCGCGTACTGGCTGTACGGAGCAGGAGCaAGAGGTCGCCGCCAGCCTCCGCCGCCGAGCCTC 
GTTCGTGTCCCCGCCCCTCGCTCCTGCAGCTACTGCTCAGAAACGCTGGGGCGCCCACCCTGGCAGACTAACGSiA 
GCAGCTCCCTTCCCa^CCCCAACTGC7«MTCTJ!^TTTTGGAa3CTTTGCCT 
CX3CaGAGGCGGAGGOTCGCGTATTCCTGCaGTC3^GCaCCCACGTC6^^ 

GCGAGCGGGGCTCTCCGTCTGCGGTCCCTTCTGAAGGCTCTGGGCGGCroCSVGAGGCCGGCTO 

CACCTCrCCCAGGAAACTTCACaiCTGGAGAGCCAAAAGGAGTGGAAGAGCCrGTCTTGGAGATTTTCCT^^ 

ATCCTGAGGTCa.TTCAT TATGa AGTGTACCGCGCGGGAGTGQCTCAGAGTAACCACaGTGCTGTTCATGGCTAGA 

GCaATTCaVGCCTVTGGTGGTTCCCaATGCCACTTTATTGGAGAAACTTTTGGAAAAATACATGGATGAGGATGGT 

GAGTGGTGGATAGCCAAACAACGAGGGAAAAGGGCCATCACAGACAATGACT^TGCAGAGTATTTT 

AATAAATTACGAAGTCAGGTGTATCCAACAGCCTCTAATATGGAGTATATGACATGGQATGTAGAGCTGGAAAGA 

TCTGCaiGAATCCTGGGCTGAAAGTTGCTTGTGGGAACaTGGACCrGCAAGCTTGCTTCCATCAATTGGACAGAAT 

TTGGGAGa^CACTGGGGAAGATATAGGCCCCCGAOSTTTCATGTAa^TCGTGGTATGATGAJ^ 

CaGGTCGTGTGGGO^CTAGTAACAGftATCXSGTTGTGCCATTAATTTGTO^ 
ATATGGCCXakRAGCTGTCTACCTGGTGTGCaATTACTCCCCaAAGGGAAACnX^ 
CaTGGGCXSGCCCTGTTCTGCTTGCCCACCrAGTTTTGGAGGGGGCTGTAGAGAAAATCTGTC 
TCaGAa«3GTATTATCCCCCTCGAGAAGAGGAAACaAATGAAATAG^ 

aiTGTCOSGACa^AGATCAGATGATAGTAGCAGAAATGAAGTCATAAGCGCACAGCAAATGTCCCT^ 

TGTGAAGTAAGATTAAGAGATCaGTGCAAAGGAACAACCTGCaATAGGTACGAATGTCCTGCTGGCTGTTTGGAT 

AGTAAAGCTAAAGTTATTGGCa.GTGTACATTATGAAATGCAATCCAGCATCTGTAGAGCTGCAATTCATTATG^ 

ATAATAGAO^TGATGGTGGCTGGGTAGATATCACTAGAO^GGAAGAAAGCATTATTTCATCaAGTCC^ 

AATGGTATTCAAAOUlTTGGCAAATATa^GTCTGCTAATTCXTTCSiaVGTCTCTA^ 

ACTTGTGAAACaACTGTGGAACS^GCTCrGTCCATTTCATAAGCCTGCTTCACaTTG^ 

CGTAACTGTATGO^GCAAATCCaCATTATGCrCGTGTAATTGGAACTOSAGTTT^^ 

TGCAGAGCAGCAGTACATGCTGGAGTGGTTai?^TCAC53GTGGTTATGTTGATG 

AAGACCTACATTGCTTCOTrrTCaUSaATGGAATCTTCTCaGAAAGT^ 

ATTTTGTATAAAACTGTAACATTAOTGTACAGAGTAaVTCa^^^ 

TAAATCTTGATAAACaAAGTCTATAAAATAAAACATGGGACaTTAGCrTTGGGAAAAGTAAT6?iAAATATAATOT 

TTTTAGAAATCCTGTGTTAAATATTGCTATATTTTCTTAGCAGTTATTTCTAaVGTTAATTAa^TAGTCATGAl^ 

GTTCTACGTTTCATATATTATATGGTGCTTTGTATATGCCACTAATAAAATGAATCTAAACATTGAATGTGAATG 

GCCCTG^GAAAATCATCTAGTGa\.TTTAAA2\ATAATCGACTCTAAAACTGAAAGAAACCTTATCACATTTTC^ 

AGTTCaUVTGCTATGCCaTTACCS^CTCCAAATAATCTCaaATAAT^ 

TOTTAATTTAGGOVTATAGAATATTAAATTCTGATATTGCACTTCrTAT^^ 

CaAATGAATCTGTTAAAATGTTTGATTCCTTGGGftATGGCCTTAAAAATAAATGTAATAAAGTC^^ 

ATGAAAACATTCCTAGTGATCaTGTAGTAAATGTAGGGTTAAGCATGG^ 

AAATTGAGGTCACATATTTTCTTTTGTATCCrGGa^AATACTCCTGCaGGCCAGGZ^ 

GAACAAAGATGAACTAATGTATTACATTACCaTTGCCaCTGATTTTTTTTJ^ 

ATATTGCCATATCATGGTACCTATAATGGTGATATATTTGTTTCTATGAAAAATGTATTGTOCTT^ 

AATCTGTAAAATGTTAGTTTTGGTAATTTTTTTTCTGCTGGTGGATTTAakTATTA^ 

TAAACATTAAAATTAATCATGTTTCaAAaAAAAAAAaA 
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</usr/seqcab2/sst/DNA/Dnaseqs .min/ss ,DNA45417 
<subunit 1 of 1, 500 aa, 1 stop 
<MW; 56888, pi: 8.53, NX(S/T) : 2 

MKCTAREWLRVTTVIiFMARAIPAMWPNATLLEKLLEKYM)EDGEWW^ 

QSILDLHNKLRSQVYPTASNMEYMTWDVELERSAESWAESCLWEHGPASLLPSIGQNLGAHW 
GRYRPPTFHVQSWYDEVKDFSYPYEHECNPYCPFRCSGPVCTHYTQWWATSNRIGCAINLC 
HNMNIWGQIWPKAVYLVCNYSPKGNWWGHAPYKHGRPCSACPPSFGGGCRENIiCYKEGSDRY 
YPPREEETNEIERQQSQVHDTHVRTRSDDSSRNEVISAQQMSQIVSCEVRLRDQCKGTTCNR 
YECPAGCLDSKAKVIGSVHYEMQSS I CRAAIHYGI IDNDGGWVDITRQGRKHYFIKSNRNGI 
QTIGKYQSANSFWSKWVQAVTCETTVEQLCPFHKPASHCPRVYCPRNCMQANPHYARVIG 
TRVYSDLSSICRAAVHAGWRNHGGYVDVMPVDKRKTYIASFQNGIFSESLQNPPGGKAFRV 
FAW 

Important features: 
Signal peptide: 

amino acids 1-20 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 protein 
amino acids 165-186, 196-218, 134-146, 96-108 and 58-77 

N-glycosylation site 
amino acids 28-31 
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GCGGAGACAAGCGCAGAGCGCAGCGCACGGCCACAGACAGCCCTGGGCATCCACCGACGGCG 
CAGCCGGAGCCAGCAGAGCCGGAAGGCGCGCCCCGGGCAGAGAAAGCCGAGCAGAGCTGGGT 
GGCGTCTCCGGGCCGCCGCTCCGACGGGCCAGCGCCCTCCCCATGTCCCTGCTCCCACGCCG 
CGCCCCTCCGGTCAGCATGAGGCTCCTGGCGGCCGC6CTGCTCCTGCTGCTGCTGGCGCTGT 
ACACCGCGCGTGTGGACGGGTCCiy^TGCAAGTGCTCCCGGAAGGGACCCAAGATCCGCTAC 
AGCGACGTGAAGAAGCTGGAAATGAAGCCAAAGTACCCGCACTGCGAGGAGAAGATGGTTAT 
CATCACCACCAAGAGCGTGTCCAGGTACCGAGGTCAGGAGCACTGCCTGCACCCCAAGCTGC 
AGAGCACCAAGCGCTTCATCAAGTGGTACAACGCCTGGAACGAGAAGCGCAGGGTCTACGAA 
GAATAGGGTGAAAAACCTCAGAAGGGAAAACTCCAAACCAGTTGGGAGACTTGTGCAAAGGA 
CTTTGCAGATTAAAAAAAAAAAAAAAAAAAAAAAA2^A2^AAAAAAAAAAAAA2^AAAGCCTTTC 
TTTCTCAGAGGCai.TAAGACACA2^TTATATATTGTTATGAAGCACTTTTTACCAACGGTCAG 
TTTTTACATTTTATAGCTGCGTGCGAAAGGCTTCCAGATGGGAGACCCATCTCTCTTGTGCT 
CCAGACTTCATCACAGGCTGCTTTTTATCAZ^AAAGGGQAAAACTCATGCCTTTCCTTTTTAA 
AAAATGCTTTTTTGTATTTGTCCATACGTCACTATACATCTGAGCTTTATAAGCGCCCGGGA 
GGAAC^TGAGCTTGGTGGACACATTTCATTGCAGTGTTGCTCCATTCCTAGCTTGGGAAGC 
TTCCGCTTAGAGGTCCTGGCGCCTCGGCACAGCTGCCACGGGCTCTCCTGGGCTTATGGCCG 
GTCACAGCCTCAGTGTGACTCCACAGTGGCCCCTGTAGCCGGGCAAGCAGGAGCAGGTCTCT 
CTGCATCTGTTCTCTGAGGAACTCAAGTTTGGTTGCCAGAAAAATGTGCTTCATTCCCCCCT 
GGTTAATTTTTACACACCCTAGGAAACATTTCCAAGATCCTGTGATGGCGAGACAAATGATC 
CTTAAAGAAGGTGTGGGGTCTTTCCCAACCTGAGGATTTCTGAAAGGTTCACAGGTTCAATA 
TTTAATGCTTCAGAAGCATGTGAGGTTCCCAACACTGTCAGCAAAAACCTTAGGAGAAAACT 
TAAAAATATATGAATACATGCGCAATACACAGCTACAGACACACATTCTGTTGACAAGGGAA 
AACCTTCAAAGCATGTTTCTTTCCCTCACCACAACAGAACATGCAGTACTAAAGCAATATAT 
TTGTGATTCCCCATGTAATTCTTCAATGTTAAACAGTGCAGTCCTCTTTCGAAAGCTAAGAT 
GACCATGCGCCCTTTCCTCTGTACATATACCCTTAAGAACGCCCCCTCCACACACTGCCCCC 
CAGTATATGCCGCATTGTACTGCTGTGTTATATGCTATGTACATGTCAGAAACCATTAGCAT 
TGCATGCAGGTTTCATATTCTTTCTAAGATGGAAAGTAATAAAATATATTTGAAATGTAAAA 
AAAAAAAAAAA 
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MSLLPRRAPPVSMRLLAAALLLLLLALYTARVDGSKCKCSRKGPKIRYSDVIOaiiEMKPKYPH 
CEEKMVIITTKSVSRYRGQEHCLHPKLQSTKRFIKWYNAWNEKRKVYEE 

Signal sequence: 

amino acids 1-34 
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GCCCCAGGGACTGCTATGGCTTCCTTTGTTGTTCAaZCCGGTCTOCGTCATCTTl^ 

GTTAACTGCTCTTGCCaTCaAGTTCaCCCTaiTTGACaGCCaAGCa^ 

CaAAATCCGGGGCCTiU^GAACavCCXSTTACCaVATGAGATCTrGGGTCCAGTGGaGC^ 

TGCCTCACCCCCCACTGGa.GAGA.GGCGGTTTCAGCCCCaVGaACCCCCGTCCTCCTGGACri^ 

TACTCAGTTTGCTGCTGTGTGCCCCCAGaiCCTGGATGAGAGATCCTTACTGCATGACATGCTGCCCaT^ 

TACCGCCJ^TTTGGATACTTTGATGACCTATGTTCy^GATCaAAATGAAGACTGCCTTTACTTAAACATOT 

GCCCACGGAAGATGGAGCCAACACAAAGAAAAACGCAGATGATATAACGAGTAATGACCGTGGTGAAGACG^ 

TATTCATGATCAGAACAGTAAGAAGCCCGTCATGGTCTATATCCATGGGGGATCTTACATGGAGGGCACCGGCAA 

CATGATTGACGGCAGCATTTTGGCAAGCTACGGAAACGTCATCGTGATCACCATTAACTACCGTCTGGC^ 

AGGGTTTTTAAGTACCGGTGACCAGGCAGCAAAAGGCAACTATGGGCTCCTGGATCAGATTCAAGCACTGCGGTG 

GATTGAGQAGAATGTGGGAGCCTTTGGCGGGGACCCCAAGAGAGTGACCATCTTTGGCTCGGGGGCTGGGGCCTC 

CroTGTOiGCCTGTTGACCCTGTCCCACTACTCAGAAGGTCTCTTCCAGAAGGCCATCATTCAGAGCGGCACCGC 

CCTGTCCaGCTGGGa^GTGAACTACCAGCCGGCC3Ui.GTACACTCGGATATTGGC^ 

GCTOGACACXACGGACATGGTAGAATGCCTGCGQaAa^GaACTACaAGGAGCTC^^ 

GGCCACCTACCACSVTAGCCTTCGGGCCGGTGATCQACGGCQACXSTCATCCCaGACmCCCCC^ 

GCAAGGCGAGTTCCTCAACTACGACSVTCATGCrrGGGCGTCaACCaAGGGGAAGGCCTGaAGTTCGT^^ 

CGTGGATAACGAGGACGGTGTGACGCCCAACGACTTTGACTTCTCCGTGTCCAACTTCGTGGACAACCTTTACGG 

CTACCCTGAAGGGAAAGACACTTTGCGGGAGACTATCAAGTTCATGTACACAGACTGGGCCGATAAGGAAAACCC 

GGAGACGCGGCGGAAAACCCTGGTGGCTCTCTTTACTGACCACCAGTGGGTGGCCCCCGCCGTGGCCGCCGACCT 

GCACGCGCAGTACGGCTCCCCCACCTACTTCTATGCCTTCTATCATCACTGCCAaAGCGAAATGAAGCCCAGCTG 

GGCAQATTCGGCCCATGGTGATGAGGTCCCCTATGTCTTCGGCATCCCCATGATCGGTCCCACCGAGCTCTTCAG 

TTGTAACTTTTCCAAGAACGACGTCATGCTCAGCGCCGTGGTCATGACCTACTGGACGAACTTCGCCAAAACTGG 

TGATCCAAATaiACCAGTTCCTCAGGATACCa2W3TTCATTCS^aVCAaAACCC3U^CCGC^ 

GTCa^GTATAATCCCAAAQACCAGCTCrATCTOCyiTATTGGCTTGAAAC^ 

AACGAAAGTGGCTTTCTGGTTGGAACTCGTTCCTC^TTTGCaa^CTrrGAAa^^ 

AACCACAAAGGTTCCTCCACCAGACATGACATCATTTCCCTATGGCACCCGGCGATCTCCCGCCAAGATATGGCC 

AACCACXaAACGCCCAGCAATCACTCCTGCCAACAATCCCAAACACTCTAAGGACCCTCACAAAACAGGGCCTGA 

GGACACAACTCTCCTCATTGT^AACCAAACGAGATTATTCCa^CCGAATTAAGTGTCACCATTGCCGTCGGGGCGT^ 

GCTCCTCTTCCTCS^CATCTTAGCTTTTGCGGCGCTGTACTACAAAAAGGACAAGAGGCGCCATGAGACTCACAG 

GCGCCCCAGTCCCCAGAGAAACaCCACAAATGATATCGCTCACATCCAGAACGAAGAGATCATGTCTCTGCAGAT 

GAAGCAGCTGGAACACGATCACGAGTGTGAGTCGCTGCAGGCACACGACACACTGAGGCTCACCTGCCCGCCAGA 

CTACACCCTCACGCTGCGCCGGTCGCCaGATQAaiTCCCACTTATGACGCa^CACC^^ 

CAO^CTGACGGGGATGaVGCCTTTGCACa^CTTTTAAaiCCTTCAG 

CGGAO^TTCaW^aVCTAGAGTASAgCTTTGCCCTATTTCC 

AGAAGAGGGAAGGaAAGAGAGAAGGaAAGAGAGAGAGAAAGAAAGTCTCCAGACCa>GGAATGTTTTTGTCCCA 

GACTTAAGACaAAAATGCAAAAAGGCAGTCATCCCATCCCGGCAGACCCTTATCGTTGGTGTTTTCCAGTATT^^ 

AAGATC?^CTTCTGACCCTGTGAAATGTGAGAAGTACACATTTCTGTTAAAATAACTGCTTTAAGATCTCTACCA 

CTCCAATCAATGTTTAGTGTGATAGGACATCACCATTTCAAGGCCCCGGGTGTTTCCAACGTCATGGAAGCAGCT 

GACACTTCTGAAACTCAGCCAAGGACACTTGATATTTTTl^AATTACy^TGGAAGTTTARACATTTCTra 

a^CAOW^TGGATGGCTCTCCTTAAGTGAAGAAAGAGTCAATGAGATTTTGCCCaGCaCATGGl^^ 

AQAGAAGGAAACXSTAGAAATTTATTATTAAAAGAATGGACTGTGG^GCGAAATCTGTACGGTTCTGTGC^^ 

GTGTTTTGCCAGCCTGAACTATATTTAAGAGACTTTGT 



FIGURE 151 



MLNSNVLLWLTALAIKFTLIDSQAQYPWNTNYGKIRGLRTPLPNEILGPVEQYLGVPYASP 

PTGERRFQPPEPPSSWTGIRMTTQFAAVCPQHIJDERSLLHDMLPIWFTAIS^ 

EDCLYLNIYVPTErX3J^KKN2UDDITSiroRGEDEDIHDQNSKKPVMVYIHGGSYM 

DGSILASYGWIVITIimUliGILGFLSTGDQAAKGNYGLLDQIQALRWIEENVGAFGGDPKR 

WIFGSGAGASCVSLLTLSHYSEGLFQKAIIQSGTALSSWAVNYQPAKYTRILADKVGCNML 

DTTDMVECLRNKNYKEL I QQTI TPATYHI AFGPVIDGDVI PDDPQ I LMEQGEFLNYDIMLGV 

NQGEGLKFVDGIVDNEDGVTPNDFDFSVSNFVDNLYGYPEGKDTLRETIKFMYTDWADKENP 

ETRRKTLVALFTDHQWVAPAVAADLHAQYGSPTYFYAFYHHCQSEMKPSWADSAHGDEVPYV 

FGIPMIGPTELFSCNFSKNDVMLSAWMTYWTNFAKTGDPNQPVPQDTKFIHTKPNRFEEVA 

WSKYNPKDQLYLHIGLKPRVRDHYRATKVAFWLELVPHLHNLNEIFQYVSTTTKVPPPDMTS 

FPYGTRRSPAKIWPTTKRPAITPANNPKHSKDPHKTGPEDTTVLIETKRDYSTELSVTIAVG 

ASLLFmiLAFAALYYKKDKRRHETHRRPSPQRNTTiroiAHIQEJEEIMSLQMKQLEHDHECE 

SLQAHDTIJILTCPPDYTLTLRRSPDDIPLMTPNTITMIPNTLTGMQPLHTFNTFSGGQNSTN 

LPHGHSTTRV 

Signal sequence: 

amino acids 1-24 

Transmembrane domains: 
amino acids 189-204, 675-692 
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GGGAAAGATGGCGGCGACTCTGGGACCCCTTGGGTCGTGGCAGCAGTGGCGGCGATGTTTGT 

CGGCTCGGGATGGGTCCAGGATGTTACTCCTTCTTCTTTTGTTGGGGTCTGGGCAGGGGCCA 

CAGCAAGTCGGGGCGGGTCAAACGTTCGAGTACTTGAAACGGGAGCACTCGCTGTCGAAGCC 

CTACCAGGGTGTGGGCACAGGCAGTTCCTCACTGTGGAATCTGATGGGCAATGCCATGGTGA 

TGACCCAGTATATCCGCCTTACCCCAGATATGCAl^GTAAACAGGGTGCCTTGTGGAACCGG 

GTGCCa.TGTTTCCTGAGAGACTGGGAGTTGCAGGTGa^CI^CAAAATCCATGGACAAGG 

OAAGAATCTGCATGGGGATGGCTTGGCAATCTGGTACACAAAGGATCGGATGCAGCCAGGGC 

CTGTGTTTGGAAACATGGACAAATTTGTGGGGCTGG6AGTATTTGTAGACACCTACCCCAAT 

GAGGAGAAGCAGCAAGAGCGGGTATTCCCCTACATCTCAGCCATGGTGAACAACGGCTCCCT 

CAGCTATGATCATGAGCGGGATGGGCGGCCTACAGAGCTGGGAGGCTGCACAGCCATTGTCC 

GCAATCTTCATTACGACACCTTCCTGGTGATTCGCTACGTCAAGAGGCATTTGACGATAATG 

ATGGATATTGATGGCAAGCATGAGTGGAGGGACTGCATTGAAGTGCCCGGAGTCCGCCTGCC 

CCGCGGCTACTACTTCGGCACCTCCTCCATCACTGGGGATCTCTCAGATAATCATGATGTCA 

TTTCCTTGAAGTTGTTTGAACTGACAGTGGAGAGAACCCCAGAAQAGGAAAAGCTCCATCGA 

GATGTGTTCTTGCCCTCAGTGGACAATATGAAGCTGCCTGAGATGACAGCTCCACTGCCGCC 

CCTGAGTGGCCTGGCCCTCTTCCTCATCGTCTTTTTCTCCCTGGTGTTTTCTGTATTTGCCA 

TAGTCATTGGTATCATACTCTACAACAAATGGCAGGAACAGAGCCGAAAGCGCTTCTA CTG& 

GCCCTCCTGCTGCCACCACTTTTGTGACTGTCACCCATGAGGTATGGAAGGAGCAGGCACTG 

GCCTGAGCATGCAGCCTGGAGAGTGTTCTTGTCTCTAGCAGCTGGTTGGGGACTATATTCTG 

TCACTGGAGTTTTGAATGCAGGGACCCCGCATTCCCATGGTTGTGCATGGGGACATCTAACT 

CTGGTCTGGGAAGCCACCCACCCCAGGGCAATGCTGCTGTGATGTGCCTTTCCCTGCAGTCC 

TTCCATGTGGGAGCAGAGGTGTGAAGAGAATTTACGTGGTTGTGATGCCAAAATCACAGAAC 

AGAATTTCATAGCCCAGGCTGCCGTGTTGTTTGACTCAGAAGGCCCTTCTACTTCAGTTTTG 

AATCCACAAAGAATTAAAAACTGGTAACACCACAGGCTTTCTGACCATCCATTCGTTGGGTT 

TTGCATTTGACCCAACCCTCTGCCTACCTGAGGAGCTTTCTTTGGAAACCAGGATGGAAACT 

TCTTCCCTGCCTTACCTTCCTTTCACTCCATTCATTGTCCTCTCTGTGTGCAACCTGAGCTG 

GGAAAGGCATTTGGATGCCTCTCTGTTGGGGCCTGGGGCTGCAGAACACACCTGCGTTTCAC 

TGGCCTTCATTAGGTGGCCCTAGGGAGATGGCTTTCTGCTTTGGATCACTGTTCCCTAGCAT 

GGGTCTTGGGTCTATTGGCATGTCCATGGCCTTCCCAATCAAGTCTCTTCAGGCCCTCAGTG 

AAGTTTGGCTAAAGGTTGGTGTAAAAATCAAGAGAAGCCTGQAAGACATCATGGATGCCATG 

GATTAGCTGTGCAACTGACCAGCTCCAGGTTTGATCAAACCAAAAGCAAa^TTTGTCATGTG 

GTCTGACCATGTGGAGATGTTTCTGGACTTGCTAGAGCCTGCTTAGCTGCATGTTTTGTAGT 

TACGATTTTTGGAATCCCACTTTGAGTGCTGAAAGTGTAAGGAAGCTTTCTTCTTACACCTT 

TGCTGTTCTCATGTTCCAAGTCTGAGAGCAACAGACCCTCATCATCTGTGCCTGGAAGAGTT 
CACTGTCATTGAGCAGCACAGCCTGAGTGCTGGCCTCTGTCAACCCTTATTCCACTGCCTTA 
TTTGACAAGGGGTTACATGCTGCTCACCTTACTGCCCTGGGATTAAATCAGTTACAGGCCAG 
AGTCTCCTTGGAGGGCCTGGAACTCTGAGTCCTCCTATGAACCTCTGTAGCCTAAATGAAAT 
TCTTAAAATCACCGATGGAACCAAAAAAAAAAAAAAAAAGGGCGGCCGCGACTCTAGAGTCG 
ACCTGCAGTAGGGATAACAGGGTAATAAGCTTGGCCGCCATGG 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA50911 
xsiabunit 1 of .1, 348 aa, 1 stop 
xMW: 39711, pi: 8.70, NX(S/T) : 1 

MAATLGPLGSWQQWRRCLSARDGSRMLLLLLLLGSGQGPQQVGAGQTFEYIiKREHSLSKPYQ 

GVGTGSSSLmLMGNAMVMTQYIRLTPDMQSKQGALWNRVPCFLRDWELQVHFKIHGQGKKN 

IJ^GIX3IiAIWYTKDRMQPGPVFGN^^DKFVGLGVFVDTYPNEEKQQERVFPYISAMVm 

DHERDGRPTEIK3GCTAIVKWLHYDTFLVIRYVKRHLTIMMDIDGKHEWRDCIEVPGVRLPRG 

YYFGTSSITGDLSDimDVISLKLFELTVERTPEEEKLHRDVFLPSVDlSIMKLPEMTAPLPPLS 

GIiALFLIVFFSLVFSVFAI VIGI ILYNKWQEQSRKRFY 

Signal sequence: 

amino acids 1-38 

Transmembrane domain: 

amino acids 310-329 
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CCG3VC3CCX3GGCGCGCAGCGACGGAGCTGGGGCCGGCCTGGGACCaTGGGCGTGAGTGCAATCTA^ 

CTGATGGTGGGTCGTTAACCTCAGTGGGGACTCCaAGATTTCCATGAAGaAAATC^ 

TGGGGTCTGGCTCaGAATTCCTGCaGCTGGTGJUiAATCTGTTTTCrAGaAGAGGTTT^^ 

GACaTGTTCCCGATTTQ?VGGTGaaACa^TGAJU3AGAa?iATAGaATACT 

TGCTGCTGCTGGCCCTGGCTGCGCTGCTGGCCTTTGTGAGCCTCAGCCTGCAGTTOTTCC^ 

CGACTCCTAAGAATGGAATGAGTAGCAAGAGTCGAAAGAGAATCATGCCCGACCCTGTGACGGAGCCCCCTGTGA 

CAGACCCCGTTTATGAAGCTCTTTTGTACTGCAACATCCCCyiGTGTGGCCGAGCGCAGCATGQAAGGTCA 

CGCATCATTTTAAGCTGGTCTCAGTGCATGTGTTCATTCGCCACGGAGACAGGTACCCACTGTATGTCATTCCC^ 

AAACAAAGCGACCAGAAATTGACTGCACTCTGGTGGCTAACAGGAAACCGTATCACCCAA2^CTGG^ 

TTAGTCACSlTGTCAAaAGGATCCGGAGCCTCTTTCGAAAGCCCCTTGAACTCCTTGCCTCTTTACCCAAATCACC 

CaTTGTGTGAGATGGGAGAGCTCmc:aCAGACAGGAGTTGTGCAGCATTTGCAGAACGGTCa.GCTGCT^ 

TCTATCTAAAGAAACACAAACTCCTGCCCAATGATTGGTCTGCAGACCAGCTCTAT^ 

GCCGGACCCTAOiJ^GTGGGCTGGCCTTGCTTTATGGCTTTCTCCCAGATTTTGACT^^ 

GGCACCaGCCAAGTGCGCTGTTCrGCTCrGGAaGCTGCTATTGC^ 

AGCJGTCGTCAGTACCTCCTACGTTTGAAAAACaGCCaGCTGGAGAAGACOTAC^^ 

ATGTCCCCACCAAGCAGCTTAGaGCTGCCAACCCCATAGACTCCa^TGCTCTGCCACTTCTGCCACSUVTG 

TTCCCTGTACCy^GAAATGGCTGTGTTGACATGGAG(::a.CTTCAAGGTAATTAAGACCCATCAGATCGAGGATG^ 

GGGAAAGACGGGAGAAGAAATTGTACTTCGGGTATTCTCTCCTGGGTGCCGACCCCATCCTGAACCAAACCATCG 

GCCGGATGCAGCGTGCCACCGAGGGCAGGAAAGAAQAGCTCTTTGCCCTCTACTCTGCTCATGATGTCACTCTGT 

CACCAGTTCTCAGTGCCTTGGGCCTTTCAGAAGCCaGGTTCCa^GGTTTGCaGCCAGGTTGATCTTT^ 

GGCAAGACAGAGAAAAGCCCAGTGAACa^TTCCGTCCGGATTCTTTACaATGGCGTCGATGTCACATTCCACA 

CTTTCTGCCa^GACO^CrACAAGCGTTCTCCCa^GCCCSVTGTGCCCG^ 

GGGACATGTTTGTAGCCCTGGCTGGCAGTGGTACSiAATTATTATGATGCATGTaVCA 

TATGOMSTACAGaVGTATAGAATCa^TGCCaATACaGaGCaTAGGGRAAGCT 

TAAGGGTAGAAGATTATTGCTTTTTAAAGGCTAAATATTGTTTGTGGGAACCaaVfiATGG 

AAGCACATTGCTGCAATGTGGTACGTGAATTGCTTGGTACaaAATGGCCAGTTCAai^^ 

TATOlTAGCCAQACTTCGCTTAGAATGCCAGAATAATATAGTTCaAGACCTGAlWSTO 

TCTTCTGGCCTGCCCCATGTTACTATGTGATGGaACa^GCACaCCTCAACCa^AAATTTTTTTAATCTTAGA<^ 

TTTACCTTGTCCTTGTTAAGAATTTCTTGAAGTGATTTATCTAAAATAAAGGTTGGCAAACTTlTTCrGTAA^ 

GCCAGATTGTAAATATTTCAGACTGTGTGGACCAAAAGGCCACATACAGTCTCTGTaVTAACTACTa^ 

TTCTGAAGCaiGGAAAGCCACCACAQACAGTACATAAAGGAATATGTGTAGCTGGGTTCCCAGGCCAGACAAAACA 

GATGGTGACCy^GACTTGGCCCCTGGGCTGTAGTTTGCTGACCCCTCATCTAAAAAATAGGCTATACTAa^TTGC 

ACTTCXS^GCACTTTGAGaACraAGTTGAATACCaUiGAATTATTa^TGGTTCCT 

CAGAATTTGGTCraTATCrGACACTAGAACaaAACTTGAGGGTAAATA 

AACTGATTAGAAGAATACTTGATGTTTATGATGATTGTGGTACaiAGATAGTTTTAAGTATGTTC^^ 

CTGCTGTAGTCTATTTGCTGTATATGCTGAAATTTTTGTATGCCATTTAGTATTTTTATAGTTTAGGAAAATAOT 

TTCTAAGACCAGTTTTAGATGACTCTTATTCCTGTAGTAATATTCAATTTGCTGTACCTGCr^ 

GAGGCTAGAAGATGAATTCAGGCACTTTCTTCCAATAAAACTAATTATGGCTCATTCCCTTTGAa^GCTGT^^ 

ACTGGATTCATTTTTAAACCATTTTCATCAGTTTCAAATGGTAAATTCTGATTGATTTTTAAATGCGTTTTTGGA 

AGAACTTTGCTATTAGGTAGTTTACAGATCTTTATAAGGTGTTTTATATATTAGAAGCAATTATAATTACATCTG 

TGATTTCTGAACTAATGGTGCTAATTCAGAGAAATGGAAAGTGAAAGTGAGATTCTCTGTTGTCATCGGCATTCC 

AACTlTTTCTCTTTGTTTTTGTCCAGTGTTGCaTTTGAATATGTCTGTTTCTATAAATAAATTTTTT^ 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA48329 
xsiabunit 1 of 1, 480 aa, 1 stop 
xMW: 55240, pi: 9.30, NX(S/T): 2 

MLFRiniFLLLLMAALLAFVSLSLQFFHLIPVSTPKNGMSSKSRKRIMPDPVTEPPVTDPV^ 
EALLYCNIPSVAERSMEGHAPHHFKLVSVHVFIRHGDRYPLYVIPKTKRPEIDCTLVANRKP 
YHPKLEAFISHMSKGSGASFESPLNSLPLYPNHPLCEMGELTQTGWQHLQNGQLLRDIYLK 
KHKLLPNDWSADQLYLETTGKSRTLQSGLALLYGFLPDFDWKKIYFRHQPSALFCSGSCYCP 
VRNQYLEKEQRRQYLLRLKNSQLEKTYGEMAKIVDVPTKQLRAANPIDSMLCHFCHNVSFPC 
^ TRNGCVDMEHFKVIKTHQIEDERERREKKLYFGYSLLGAHPILNQTIGRMQRATEGRKEELF 
h ALYSAHDVTLSPVLSALGLSEARFPRFAARLIFELWQDREKPSEHSVRILYNGVDVTFHTSF 
CQDHHKRSPKPMCPLENIAmFVKRDMFVALGGSGTNYYDACHREGF 

p Signal sequence: 

U: amino acids 1-18 
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AAAaAAGCTCacrAAaGTTTCTATTAGAGCGAATAaaSTi^ 
GOTATTTi^GAGATJWU^CXSaAATATCCOTTCTGGG^ 

GCCGCTGTTCACCaATCGGGGAGaGaaAAGCGGaGATCCrGCrCGCCTTGCACG^ 

AGCTAGGAATQAACCATCCCTGGGAGTATGTGGAAACaACGGAGQAGCTCTGACTTCCCaACTGTCCC^ 

GGGCGAAGGAACTGCTCCK3ACTTaVCTGGTTAA(3GGCaGAATTGAA 

TCCTGCGCGACTGCACCGGGACTACAAAGGGCTTGTCCTGCTGGGAATCCTCCTGGGGACTCTGTGGGAGACXX36 

ATGCACCCAGATACGOTATTCAGTTCCGGAAGAGCTGGAGAAAGGCTCTAGGGTGGGCGACa.TCTCa^GGGACCT 

GGGGCTGGAGCCCCGGGAGCTCGCGGAGCGCGGAGTCCGGATCATCCCCAGAGGTAGGACGCAGCTTTTCGCCCT 

QAATCCGCGCAGCGGCAGCTTGGTCaCGGCGGGCAGGATAGACCGGGAGGAGCTCTGTATGGGGGCCaVTCaAGT^ 

TCAATTAAATCTAGACATTCTGATGGAGGATAAAGTGAAAATATATGGAGTAGAAGTAGAAGTAAGG^ 

CGACAATGCGCCTTACTTTCGTGAAAGTGAATTAGAAATAAAAATTAGTGAAAATGCAGCCACTGAGATGCGGTT 

CCCTCTaCCCCACGCCTGGGATCCGGATATCGGGAAGAACTCTCTGCAGAGCTACGAGCTCAGCCCGAACACTC^ 

CTTCTCCCTCATCGTGaiAAATGGAGCCGACGGTAGTAAGTACCCa3AATTGGT^ 

CGAAGAAAAGGCTGCTCaCCACCTGGTCCTTACXSGCCTCC^ 

CaTCCGCGTGATGGTTCTCG&TGCGAACGACaACGCaCCAGCGTTTGCTa^^^ 

GGAGAATCTGGCCTTGGGCACGaVGCTGCTTGTAGTCaACGCTACCGACCCTGACGAAGGAGTCAATGCG<^ 

GAGGTATTCCTTCCGGTATGTGQACGACAAGGCGGCCGAAGTTTTCAAACTAGATTGTAATTCAGGGACy^TATC 

AACaATAGGGGAGTTGGACCACGAGGAGTCAGGATTCTACCAGATGGAAGTGCa^GCAATGGAT^ 

TTCTGCGCGAGCCAAAGTCCTGATCACrGTTCTGGACGTQAACGACAATGCCCCAGAAGTGGTCCTai 

OSCCAGCTCGGTTCCCGAAAACTCTCCCAGAGGGACATTAATTGCCCTTTTAAATGTAAATGACCAAGATTCTGA 

GGAAAACGGACAGGTGATCTGTTTCATCCAAGGAAATCTGCCCTTTAAATTAGAAARATCTTACGGAAATTACTA 

TAGTTTAGTCACAGACATAGTCTTGGATAGGGAACAGGTTCCTAGCTACAACATCa^CAGT^ 

GGGAACCCOK^CCCTATCCACGGaAACTCATATCTCGCTGAACGTGGCaGACACa^ 

CCCTCaGGCCTCCTATTCXXSCTTATATCCCAGAGAAa^TCCCa^ 

CXSACCCCGACTOTGAAGAGAACGCCCAGATCACTTATTCCCrGGCTGAQAACacaVTCCS^ 

GTCOTACGTGTCCATCAACTCCGACACTGGGGTACTGTATGCGCTGAGCTCCTTCGACTACGAGCAGTTCa^ 

CTTGCAAGTGAAAGTGATGGCGCGGGAG^ACGGGCACCCGCCCCTCAGCAGCAACGTGTCXSTTGAGCCrGTTC^ 

GCTGQACOIGAACGACT^TGCGCCCGAGATCCTGTACCCCGCCCTCCCCACGGACGGTTCCACTGGCGTGQAGCT 

GGCTCCCCGCTCCGCAGAGCCCGGCTACCTGGTGACOiAGGTGGTGGCGGTGGACAGAGACTCCGGCCAGAACGC 

CTGGCTGTCCTACCGTCTGCTCAAGGCCAGCGAGCCGGGACTCTTCTCGGTGGGTCTGCACACGGGCGAGGTGaS 

CACGGCGCGAGCCCTGCTGGACAGAGACGCGCTCAAGCAGAGCCTCGTAGTGGCCGTCCAGGACCACGGCCAGOT 

CCCTCTCTCCGCOiCTGTCACGCTCACCGTGGCCGTGGCCGACAGCATCCCCaUlGTCCTGGaSGACCT 

CCTCGAGTCTCCAGCTAACTCIX3A7^CCTaVGACCTCACrrCTGTACCT 

CGTCTTCCTGGCCTTCGTCATCnTOCrGCrGGCGCT 

TTCyiGGAGGCGGCTTGACAGGAGa3CCGGCGTa3CACTTTGTGGGOTTGGACGGGGTGCa<^^ 

CTATTCCCACGAGGTTTCCCTCACCACGGACTCGCGGAAGAGTGACCTGATCTTCCCCCAGCCCAACTATGCAGA 

CATGCTCGTCAGCCAGGAGAGCTTTGAAAAAAGCGAGCCCCTTTTGCTGTCAGGTGATTCGGTATTTTCTAAAGA 

CaiGTa^TGGGTTAATTGAGGTGAGTTTATATCAAATCTTCTTTCTTTTTTTTTTTAATTGCTCTGTCTCCCAAGC 

TGGAGTGCAGCGGTACGATCATAGCTCTICTGCGGCCTCAAACTCCTAGGCTCAAGCAATTATCCCACCTTTGCCT 

CCGGTGTAACAGGGACTACAGGTGCAAGCCACCTACTGTCTGCCTATCTATCTATCTATCTATCTATCTATCTAT 

OGftGGCGGGTGGATCa^CCTGAGGTTGGGAGTTTGAGACCAGCCTGACCaACATGGAGAAACCCCGTCTATAC^^ 

AAAAATACAAAATTAGCOSGGCraTGGTGGTGCATGTCTGTAATCCCaGCTACTTGGG^ 

TGCTTTAACCT^SGGAGGTGQAGGTTGCAATGaGCTGaGATTGTGCCa 

AAACTCTATCTCA 



FIQTJRE 157 



x/usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA48306 
xsubimit 1 of 1, 916 aa, 1 stop 
xMW: 100204, pi: 4-92, NX(S/T) : 4 

MIPAEILHRDYKGLVLLGILLGTLWETGCTQIRYSVPEELEKGSRVGDISRDLGLEPREIAER 
GVRI I PRGRTQLFALNPRSGSLVTAGRIDREELCMGAI KCQLNLD I LMEDKVKI YGVE VE VR 
DINDNAPYFRESELEIKISENAATEMRFPLPHAWDPDIGKNSLQSYELSPNTHFSLIVQNGA 
DGSKYPELVLKRALDREEKAAHHLVLTASDGGDPVRTGTARIRVMVLDANDNAPAFAQPEYR 
ASVPENLALGTQLLWNATDPDEGVNAEVRYSFRYVDDKAAQVFKLDCNSGTISTIGELDHE 
E SGF YQME VQAMDNAGYS ARAKVL I TVLDVNDNAPEWLTS IAS S VPENS PRGTLI ALLNVN 
DQDSEENGQVICFIQGNLPFKLEKSYGNYYSLVTDIVLDREQVPSYNITVTATDRGTPPLST 
ETHI SLNVADTNDNPPVFPQAS YSAYI PENNPRGVSLVSVTAHDPDCEENAQITYSLAENTI 
QGASLSSYVSINSDTGVLYALSSFDYEQFRDLQVKVMARDNGHPPLSSNVSLSLFVLDQNDN 
APEILYPALPTDGSTGVELAPRSAEPGYLVTKWAVDRDSGQNAWLSYRLLKASEPGLFSVG 
LHTGEWTARALLDRDALKQSLWAVQDHGQPPLSATVTLWAVADSIPQVLADLGSLESPA 
NSETSDLTLYLVVAVAAVSCVFIAFVILLLALRLRRWHKSRLLQASGGGLTGAPASHFVGVD 
GVQAFIiQTYSHEVSLTTDSRKSHLIFPQPNYADMLVSQESFEKSEPLLLSGDSVFSKDSHGL 
lEVSLYQIFFLFFFNCSVSQAGVQRYDHSSLRPQTPRLKQLSHLCLRCNRDYRCKPPTVCLS 
lYLSIYLSIYLSIYLLLSCTDGSLTPVIPVliWElUEAGGSPEVGSIiRPA 

Signal sequence s 
amino acids 1-30 

Transmembrane domains: 

amino acids 693-711, 809-823, 869-888 



FIGURE 158 



CCCAGGCTCTAGTGCAGGAGGAGAAGGAGGAGGAGCAGGAGGTGGAGATTCCCAGTTiy^AAG 
GCTCCAGAATCGTGTACCAGGCAGAGAACTGAAGTACTGGGGCCTCCTCCACTGGGTCCGAA 
TCAGTAGGTGACCCCGCCGCTGGATTCTGGAAGACCTCACCATGGGACGCCCCCGACCTCGT 
GCGGCCAAGACGTGGATGTTCCTGCTCTTGCTGGGGGGAGCCTGGGCAGGACACTCCAGGGC 
ACAGGAGGACAAGGTGCTGGGGGGTCATGAGTGCCAACCCCATTCGCAGCCTTGGCAGGCGG 
CCTTGTTCCAGGGCCAGCAACTACTCTGTGGCGGTGTCCTTGTAGGTGGCAACTGGGTCCTT 
ACAGCTGCCCACTGTAAAAAACCGAAATACACAGTACGCCTGGGAGACCACAGCCTACAGAA 
TAAAGATGGCCCAGAGCAAGAAATACCTGTGGTTCAGTCCATCCCACACCCCTGCTACAACA 
GCAGCGATGTGGAGGACCACAACCATGATCTGATGCTTCTTCAACTGCGTGACCAGGCATCC 
CTGGGGTCCAAAGTGAAGCCCATCAGCCTGGCAGATCATTGCACCCAGCCTGGCCAGAAGTG 
CACCGTCTCAGGCTGGGGCACTGTCACCAGTCCCCGAGAGAATTTTCCTGACACTCTCAACT 
GTGCAGAAGTAAAAATCTTTCCCCAGAAGAAGTGTGAGGATGCTTACCCGGGGCAGATCACA 
GATGGCATGGTCTGTGCAGGCAGCAGCAAAGGGGCTGACACGTGCCAGGGCGATTCTGGAGG 
CCCCCTGGTGTGTGATGGTGCACTCCAGGGCATCACATCCTGGGGCTCAGACCCCTGTGGGA 
GGTCCGACAAACCTGGCGTCTATACCAACATCTGCCGCTACCTGGACTGGATCAAGAAGATC 
ATAGGCAGCAAGGGCTGATTCTAGGATAAGCACTAGATCTCCCTTAATAAACTCACAACTCT 
CTGGTTC 



FTTl 



FIGURE 159 



</ usr/ seqdb2 /ss t /DNA/Dnaseqs . min/ss . DNA4 83 3 6 
<subimit 1 of 1, 260 aa, 1 stop 
<MW: 28048, pi: 7.87, NX(S/T) : 1 

MGRPRPRAAKTimyiFLLLLGGAWAGHSRAQEDKVLGGHECQPHSQPWQAALFQGQQLLCGGVL 
VGGNWVLTAAHCKKPKYTVRLGDHSLQNKDGPEQE I PWQS I PHPC YNS SDVEDHNHDLMLL 
QLRDQASLGSKVKPISLADHCTQPGQKCWSGWGTVTSPRENFPDTLNCAEVKIFPQKKCED 
AYPGQITDGMVCAGSSKGADTCQGDSGGPLVCDGALQGITSWGSDPCGRSDKPGVYTNICRY 
LDWIKKIIGSKG 

Important Features: 
Signal peptide: 
amino acids 1-23 

Transmembrane domain: 
amino acids 51-71 

N-glycosylation site. 

amino acids 110-113 

Serine proteases, trypsin family, histidine active site, 
amino acids 69-74 and 207-217 

Tyrosine kinase phosphorylation site, 
amino acids 182-188 

Kringle domain proteins motif 

amino acids 205-217 



FIGURE 160 

GGCGCCGGTGCACCGGGCGGGCTGAGCGCCTCCTGCGGCCCGGCCTGCGCGCCCCGGCCCGC 
CGCGCCGCCCACGCCCCAACCCCGGCCCGCGCCCCCTAGCCCCCGCCCGGGCCCGCGCCCGC 
GCCCGCGCCCAGGTGAGCGCTCCGCCCGCCGCGAGGCCCCGCCCCGGCCCGCCCCCGCCCCG 
CCCCGGCCGGCGGGGGAACCGGGCGGATTCCTCGCGCGTCAAACCACCTGATCCCATAAl^C 
ATTCATCCTCCCGGCGGCCCGCGCTGCGAGCGCCCCGCCAGTCCGCGCCGCCGCCGCCCTCG 
CCCTGTGCGCCCTGCGCGCCCTGCGCACCC6CGGCCCGAGCCCAGCCAGAGCCGGGCGGAGC 
GGAGCGCGCCGAGCCTCGTCCCGCGGCCGGGCCGGGGCCGGGCCGTAGCGGCGGCGCCTGGA 
TGCGGACCCGGCCGCGGGGAGACGGGCGCCCGCCCCGAAACGACTTTCAGTCCCCGACGCGC 
CCCGCCCAACCCCTACGATGAAGAGGGCGTCCGCTGGAGGGAGCCGGCTGCTGGCATGGGTG 
CTGTGGCTGCAGGCCTGGCAGGTGGCAGCCCCATGCCCAGGTGCCTGCGTATGCTACAATGA 
GCCCAAGGTGACGACAAGCTGCCCCCAGCAGGGCCTGCAGGCTGTGCCCGTGGGCATCCCTG 
CTGCCAGCCAGCGCATCTTCCTGCACGGCAACCGCATCTCGCATGTGCCAGCTGCCAGCTTC 
CGTGCCTGCCGCAACCTCACCATCCTGTGGCTGCACTCGAATGTGCTGGCCCGAATTGATGC 
GGCTGCCTTCACTGGCCTGGCCCTCCTGGAGCAGCTGGACCTCAGCGATAATGCACAGCTCC 
GGTCTGTGGACCCTGCCACATTCCACGGCCTGGGCCGCCTACACACGCTGCACCTGGACCGC 
TGCGGCCTGCAGQAGCTGGGCCCGGGGCTGTTCCGCGGCCTGGCTGCCCTGCAGTACCTCTA 
CCTGCAGGACAACGCGCTGCAGGCACTGCCTGATGACACCTTCCGCGACCTGGGCAACCTCA 
CACACCTCTTCCTGCACGGCAACCGCATCTCCAGCGTGCCCGAGCGCGCCTTCCGTGGGCTG 
CACAGCCTCGACCGTCTCCTACTGCACCAGTYACCGCGTGGCCCATGTGCACCCGCT^TGCCTT 
CCGTGACCTTGGCCGCCTCATGACACTCTATCTGTTTGCCAACAATCTATCAGCGCTGCCCA 
CTGAGGCCCTGGCCCCCCTGCGTGCCCTGCAGTACCTGAGGCTCAACGACAACCCCTGGGTG 
TGTGACTGCCGGGCACGCCCACTCTGGGCCTGGCTGCAGAAGTTCCGCGGCTCCTCCTCCGA 
GGTGCCCTGCAGCCTCCCGCAACGCCTGGCTGGCCGTGACCTCAAACGCCTAGCTGCCAATG 
ACCTGCAGGGCTGCGCTGTGGCCACCGGCCCTTACCATCCCATCTGGACCGGCAGGGCCACC 
GATGAGGAGCCGCTGGGGCTTCCCAAGTGCTGCCAGCCAGATGCCGCTGACAAGGCCTCAGT 
ACTGGAGCCTGGAAGACCAGCTTCGGCAGGCAATGCGCTGAAGGGACGCGTGCCGCCCGGTG 
ACAGCCCGCCGGGCAACGGCTCTGGCCCACGGCACATCAATGACTCACCCTTTGGGACTCTG 
CCTGGCTCTGCTGAGCCCCCGCTCACTGCAGTGCGGCCCGAGGGCTCCGAGCCACCAGGGTT 
CCCCACCTCGGGCCCTCGCCGGAGGCCAGGCTGTTCACGCAAGAACCGCACCCGCAGCCACT 
GCCGTCTGGGCCAGGCAGGCAGCGGGGGTGGCGGGACTGGTGACTCAGAAGGCTCAGGTGCC 
CTACCCAGCCTCACCTGCAGCCTCACCCCCCTGGGCCTGGCGCTGGTGCTGTGGACAGTGCT 
TGGGCCCTGCTGACCCCCAGCGGACACAAGAGCGTGCTCAGCAGCCAGGTGTGTGTACATAC 
GGGGTCTCTCTCCACGCCGCCAAGCCAGCCGGGCGGCCGACCCGTGGGGCAGGCCAGGCCAG 
GTCCTCCCTGATGGACGCCTGCCGCCCGCCACCCCCATCTCCACCCCATCATGTTTACAGGG 
TTCGGCGGCAGCGTTTGTTCCAGAACGCCGCCTCCCACCCAGATCGCGGTATATAGAGATAT 
GCATTTTATTTTACTTGTGTAAAAATATCGGACGACGTGGAATAAAGAGCTCTTTTCTTAAA 



FIGURE 161 



></usr/seqdb2/sst/DNA/Dnaseqs -min/ss -DNA44184 
xsubunit 1 of 1, 473 aa, 1 stop 
xMW: 50708, pi: 9.28, NX(S/T): 6 

MKI^SAGGSRLLAWVLWLQAWQVAAPCPGACVCYNEPKVTTSCPQQGLQAVPVGIPAASQRI 
FLHGNRISHVPAASFRACRNLTILWLHSlSm^IDAAAFTGLALLEQLDLSDNAQLRSVDPA 
TFHGLGRLHTLHLDRCGLQELGPGLFRGLAALQYLYLQDNALQALPDDTFRDLGNLTHLFLH 
GNRISSVPERAFRGLHSLDRLLLHQNRVAHVHPHAFRDLGRLMTLYLFANNLSALPTEALAP 
LRALQYLRIjNDNPWVCDCRARPLWAWLQKFRGSSSEVPCSLPQRLAGRDLKRLAAJSTDLQGCA 
VATGPYHPIWTGRATDEEPLGLPKCCQPDAADKASVLEPGRPASAOIALKGRVPPGDSPPGN 
GSGPRHINDSPFGTLPGSAEPPLTAVRPEGSEPPGFPTSGPRRRPGCSRKNRTRSHCRLGQA 
GSGGGGTGDSEGSGALPSLTCSLTPLGLALVLWTVLGPC 

Important features: 
Signal peptide: 

amino acids 1-26 

Leucine zipper pattern. 

amino acids 135-156 

Glycosaminoglycan attachment site, 
amino acids 436-439 

N-glycosylation site. 

amino acids 82-85, 179-183, 237-240, 372-375 and 423-426 

VWPC domain 

amino acids 411-425 



FIGURE 162 



GGAAGTCCACGGGGAGCTTGGATGCCAAAGGGAGGACGGCTGGGTCCTCTGGAGAGGACTAC 
TCACTGGCATATTTCTGAGGTATCTGTAGAATAACCACAGCCTCAGATACTGGGGACTTTAC 
AGTCCCACAGAACCGTCCTCCCAGGAAGCTGAATCCAGCAAGAAC AATGG AGGCCAGCQGGA. 
AGCTCATTTGCAGACAAAGGCAAGTCCTTTTTTCCTTTCTCCTTTTGGGCTTATCTCTGGCG 
GGCGCGGCGQAACCTAGAAGCTATTCTGTGGTG6AGGAAACTGAGGGCAGCTCCTTTGTCAC 
CAATTTAGCAAAGGACCTGGGTCTGGAGCAGAGGGAATTCTCCAGGCGGGGGGTTAGGGTTG 
TTTCCAGAGGGAACAAACTACATTTGCAGCTCAATCAGGAGACCGCGGATTTGTTGCTAAAT 
GAGAAATTGGACCGTGAGGATCTGTGCGGTCACACAGAGCCCTGTGTGCTACGTTTCCAAGT 
GTTGCTAGAGAGTCCCTTCGAGTTTTTTCAAGCTGAGCTGCAAGTAATAGACATAAACGACC 
ACTCTCCAGTATTTCTGGACAAACAAATGTTGGTGAAAGTATCAGAGAGCAGTCCTCCTGGG 
ACTACGTTTCCTCTGAAGAATGCCGAAGACTTAGATGTAGGCCAAAACAATATTGAGAACTA 
TATAATCAGCCCCAACTCCTATTTTCGGGTCCTCACCCGCAAACGCAGTGATGGCAGGAAAT 
ACCCAGAGCTGGTGCTGGACAAAGCGCTGGACCGAGAGGAAGAAGCTGAGCTCAGGTTAACA 
CTCACAGCACTGGATGGTGGCTCTCCGCCCa.GATCTGGCACTGCrCAGGTCTACATCGAAGT 
CCTGGATGTCAACGATAATGCCCCTGAATTTGAGCAGCCTTTCTATAGAGTGCAGATCTCTG 
AGGACAGTCCGGTAGGCTTCCTGGTTGTGAAGGTCTCTGCCACGGATGTAGACACAGGAGTC 
AACGGAGAGATTTCCTATTCACTTTTCG?iAGCTTCAGAAGAGATTGGCAAAACCTTTAAGAT 
CAATCCCTTGACAGGAGAAATTGAACTAAAAAAACAACTCGATTTCGAAAAACTTCAGTCCT 
ATGAAGTCAATATTGAGGCAAGAGATGCTGGAACCTTTTCTGGAAAATGCACCGTTCTGATT 
CAAGTGATAGATGTGAACGACCATGCCCCAGAAGTTACCATGTCTGCATTTACCAGCCCAAT 
ACCTGAGAACGCGCCTGAAACTGTGGTTGCACTTTTCAGTGTTTCAGATCTTGATTCAGGAG 
AAAATGGGAAAATTAGTTGCTCCATTCAGGAGGATCTACCCTTCCTCCTGAAATCCGCGGAA 
AACTTTTACACCCTACTAACGGAGAGACCACTAGACAGAGAAAGCAGAGCGGAATACAACAT 
CACTATCACTGTCACTGACTTGGGGACCCCTATGCTGATAACACAGCTCAATATGACCGTGC 
TGATCGCCGATGTCAATGACAACGCTCCCGCCTTCACCCAAACCTCCTACACCCTGTTCGTC 
CGCGAGAACAACAGCCCCGCCCTGCACATCCGCAGCGTCAGCGCTACAGACAGAGACTCAGG 
CACCAACGCCCAGGTCACCTACTCGCTGCTGCCGCCCCAGGACCCGCACCTGCCCCTCACAT 
CCCTGGTCTCCATCAACGCGGACl^CGGCCACCTGTTCGCCCTCAGGTCTCTGGACTACGAG 
GCCCTGCAGGGGTTCCAGTTCCGCGTGGGCX3CTTCAGACCACGGCTCCCCGGCGCTGAGCAG 
CGAGGCGCTGGTGCGCGTGGTGGTGCTGGACGCCAACGACAACTCGCCCTTCGTGCTGTACC 
CGCTGCAGAACGGCTCCGCGCCCTGCACCGAGCTGGTGCCCCGGGCGGCCGAGCCGGGCTAC 
CTGGTGACCAAGGTGGTGGCGGTGGACGGCQACTCGGGCCAGAACGCCTGGCTGTCGTACCA 
GCTGCTCAAGGCCACGGAGCTCGGTCTGTTCGGCGTGTGGGCGCACAATGGCGAGGTGCGCA 
CCGCCAGGCTGCTGAGCGAGCGCGACGCGGCCAAGCACAGGCTGGTGGTGCTGGTCAAGGAC 
AATGGCGAGCCTCCGCGCTCGGCCACCGCCACGCTGCACGTGCTCCTGGTGGACGGCTTCTC 
CCAGCCCTACCTGCCTCTCCCGGAGGCGGCCCCGACCCAGGCCCAGGCCGACTTGCTCACCG 
TCTACCTGGTGGTGGCGTTGGCCTCGGTGTCTTCGCTCTTCCTCTTTTCGGTGCTCCTGTTC 
GTGGCGGTGCGGCTGTGTAGGAGGAGCAGGGCGGCCTCGGTGGGTCGCTGCTTGGTGCCCGA 
GGGCCCCCTTCCAGGGCATCTTGTGGACATGAGCGGCACCAGGACCCTATCCCAGAGCTACC 
AGTATGAGGTGTGTCTGGCAGGAGGCTCAGGGACCAATGAGTTCAAGTTCCTGAAGCCGATT 
ATCCCCAACTTCCCTCCCCAGTGCCCTGGGAAAGAAATACAAGGAAATTCTACCTTCCCCAA 
TAACTTTGGGTTCAATATTCAGTGACCATAGTTGACTTTTACATTCCATAGGTATTTTATTT 
TGTGGCATTTCCATGCCAATGTTTATTTCCCCCAATTTGTGTGTATGTAATATTGTACGGAT 
TTACTCTTGATTTTTCTCATGTTCTTTCTCCCTTTGTTTTAAAGTGAACATTTACCTTTATT 
CCTGGTTCTT 



FIGDRE 163 



</usr/seqdb2/sst/DNA/Dnaseqs .rain/ss .DNA4S314 
oubunit 1 of 1, 798 aa, 1 stop 
<MW: 87552, pi: 4.84, NX(S/T): 5 

MEASGKLICRQRQVLFSFLLLGLSLAGAAEPRSYSWEETEGSSFVTNLAKDLGLEQREFSR 
RGVRWSRGNKIJILQIJTQETJmLLLNEKLDREDLCGHTEPCVLRFQVLIiESPFEFFQAELQV 
IDINDHSPVFLDKQMLVKVSESSPPGTTFPLKNAEDLDVGQNNIENYIISPWSYFRVLTRKR 
SDGRKYPELVLDKALDREEEAELRLTLTALDGGSPPRSGTAQVYIEVLDVNDNAPEFEQPFY 
RVQISEDSPVGFLWKVSATDVDTGVNGEISYSLFQASEEIGKTFKINPLTGEIELKKQLDF 
EKLQSYEWIEARDAGTFSGKCTVLIQVIDVNDHAPEVTMSAFTSPIPENAPETWALFSVS 
DLDSGENGKISCSIQEDLPFLLKSAENFYTLLTERPLDRESRAEYNITITVTDLGTPMLITQ 
U^TVLIADVNDNAPAFTQTSYTLFVRENNSPALHIRSVSATDRDSGTNAQVTYSLLPPQDP 
HLPLTSLVSINADNGHLFALRSLDYEALQGFQFRVGASDHGSPALSSEALVRWVLDANDNS 
PFVLYPLQNGSAPCTELVPRAAEPGYLVTKWAVDGDSGQNAWLSYQLLKATELGLFGVWAH 
NGEVRTARLLSERDAAKHRLWLVKDNGEPPRSATATLHVLLVDGFSQPYIiPLPEAAPTQAQ 
ADLLTVYLWALASVSSLFLFSVLLFVAVRLCRRSRAASVGRCLVPEGPLPGHLVDMSGTRT 
LSQSYQYEVCLAGGSGTNEFKFLKPIIPNFPPQCPGKEIQGNSTFPNNFGFNIQ 

Important features: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 
amino acids 685-712 

Cadherins extracellular repeated domain signature. 

amino acids 122-132, 231-241, 336-346, 439-449 and 549-559 

ATP/GTP -binding site motif A (P-loop) . 

amino acids 285-292 

N-glycosylation site. 

amino acids 418-421, 436-439, 567-570 and 786-789 



FIGURE 164 



ACCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCGCGTAGCCGTGC 
GCCGATTGCCTCTCGGCCTGGGCaiATCGTCCCGGCTGCCGGTCGACGACCGCCCCGCGTCAT 
GCGGCTCCTCGGCTGGTGGCAAGTATTGCTGTGGGTGCTGGGACTTCCCGTCCGCGGCGTG6 
AGGTTGCAGAGGAAAGTGGTCGCTTATGGTCAGAGGAGCAGCCTGCTCACCCTCTCCAGGTG 
GGGGCTGTGTACCTGGGTGAGGAGGAGCTCCTGCATGACCCGATGGGCCAGGACAGGGCAGC 
AGAAGAGGCCAATGCGGTGCTGGGGCTGGACACCCAAGGCGATCACATGGTGATGCTGTCTG 
TGATTCCTGGGGAAGCTGAGGACAAAGTGAGTTCAGAGCCTAGCGGCGTCACCTGTGGTGCT 
GGAGGAGCGGAGGACTCAAGGTGCAACGTCCGAGAGAGCCTTTTCTCTCTGGATGGCGCTGG 
AGCACACTTCCCTGACAGAGAAGAGGAGTATTACACAGAGCCAGAAGTGGCGGAATCTGACG 
CAGCCCCGACAGAGGACTCCAATAACACTGAAAGTCTGAAATCCCCAAAGGTGAACTGTGAG 
GAGAGAAACATTACAGGATTAGAAAATTTCACTCTGAAAATTTTAAATATGTCACAGGACCT 
TATGGATTTTCTGAACCCAAACGGTAGTGACTGTACTCTAGTCCTGTTTTACACCCCGTGGT 
GCCGCTTTTCTGCCAGTTTGGCCCCTCACTTTAACTCTCTGCCCCGGGCATTTCCAGCTCTT 
CACTTTTTGGCACTGGATGCATCTCAGCACAGCAGCCTTTCTACCAGGTTTGGCACCGTAGC 
TGTTCCTAATATTTTATTATTTCAAGGAGCTAl^Ca^TGGCCAGATTTAATCATACAGATC 
GAACACTGGAAAOiCTGAAAATCTTCATTTTTAATC^GACAGGTATAGAAGCCAAGAAGAAT 
GTGGTGGTAACTCAAGCCGACCAAATAGGCCCTCTTCCCAGCACTTTGATAAAAAGTGTGGA 
CTGGTTGCTTGTATTTTCCTTATTCTTTTTAATTAGTTTTATTATGTATGCTACCATTCGAA 
CTGAGAGTATTCGGTGGCTAATTCCAGGACAAGAGCAGGAACATGTGGA GTAGT GATGGTCT 
GAAAGAAGTTGGAAAGAGGAACTTCAATCCTTCGTTTCAGAAATTAGTGCTACAGTTTCATA 
CATTTTCTCCAGTGACGTGTTGACTTGAAACTTCAGGCAGATTAAAAGAATCATTTGTTGAA 
CAACTGAATGTATAAAAAAATTATAAACTGGTGTTTTAACTAGTATTGCAATAAGCAAATGC 
AAAliATATTCAATAG 



FIGURE 165 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA48333 
xsubimit 1 of 1, 360 aa, 1 stop 
xMW: 39885, pi: 4.79, IJX{S/T) : 7 

MVPAAGRRPPRVMRLLGWWQVLLWLGLPWGVEVAEESGRLWSEEQPJ^PLQVGAVYLGEE 
ELLHDPMGQDRAAEEANAVLGLDTQGDHMVMLSVIPGEAEDKVSSEPSGVTCGAGGAEDSRC 
NVRESLFSLDGAGAHFPDREEEYYTEPEVAESDAAPTEDSNNTESLKSPKVNCEERNITGLE 
NFTLKILNMSQDLMDFLNPNGSDCTLVLFYTPWCRFSASLAPHFNSLPRAFPALHFLALDAS 
QHSSLSTRFGTVAVPNILLFQGAKPMARFNHTDRTLETLKIFIFNQTGIEAKKNVVVTQADQ 
IGPLPSTLIKSVDWLLVFSLFFLISFIMYATIRTESIRWLIPGQEQEHVE 

Important features: 
Signal peptide: 

amino acids 1-25 

Transm^osbrane domain: 

amino acids 321-340 

Homologous region to dilsufide isomerase 
amino acids 212-302 

N-glycosylation site. 

amino acids 165-168, 181-184, 187-190, 194-197, 206-209, 278-281 
and 293-296 

Thioredoxin domain 

amino acids 211-227 



FIGURE 166 



CCCGGCTCCGCTCCCTCTGCCCCCTCGGGGTCGCGCGCCCACGATGCTGCAGGGCCCTGGCT 
CGCTGCTGCTGCTCTTCCTCGCCTCGCACTGCTGCCTGGGCTCGGCGCGCGGGCTCTTCCTC 
TTTGGCCAGCCCGACTTCTCCTACAAGCGCAGCSyiTTGCAAGCCCATCCCGGTCAACCTGCA 
GCTGTGCa^CGGCATCGAATACa^GAACATGCGGCTGCCCaACCTGCTGGGCCACGa.GACCA 
TGAAGGAGGTGCTGGAGCAGGCCGGCGCTTGGATCCCGCTGGTCATGAAGCAGTGCCACCCG 
GACACCAAGAAGTTCCTGTGCTCGCTCTTCGCCCCCGTCTGCCTCGATGACCTAGACGAGAC 
CATCCAGCCATGCCACTCGCTCTGCGTGCAGGTGAAGGACCGCTGCGCCCCGGTCATGTCCG 
CCTTCGGCTTCCCCTGGCCCGACATGCTTGAGTGCGACCGTTTCCCCCAGGACAACGACCTT 
TGCATCCCCCTCGCTAGCAGCGACCACCTCCTGCCAGCCACCGAGGAAGCTCCAAAGGTATG 
fj TGAAGCCTGCAAAAATAAAAATGATGATGACAACGACATAATGGAAACGCTTTGTAAAAATG 
ATTTTGCACTGAAAATiy^AAGTGAAGGAGATAACCTACATCAACCGAGATACCAAAATCATC 
SI CTGGAGACCAAGAGCAAGACCATTTACAAGCTGAACGGTGTGTCCGAAAGGGACCT^ 
fc: ATCGGTGCTGTGGCTCAAAGACa^GCTTGCAGTGCACCTGTGAGGAGATGAACGACATCAACG 
y-. CX3CCCTATCTGGTCATGGGACAGAAACAGGGTGGGGAGCTGGTGATCACCTCGGTGAAGCGG 
TGGCAGAAGGGGCAGAGAGAGTTCAAGCGCATCTCCCGCAGCATCCGCAAGCTGCAGTGCTA 
GTCCCGGCATCCTGATGGCTCCGACAGGCCTGCTCCAGAGCACGGCTGACCATTTCTGCTCC 

m 

J; GGGATCTCAGCTCCCGTTCCCCAAGCACACTCCTAGCTGCTCCAGTCTCAGCCTGGGCAGCT 
TCCCCCTGCCTTTTGCACGTTTGCATCCCCAGCATTTCCTGAGTTATAAGGCCACAGGAGTG 
GATAGCTGTTTTCACCTAAAGGAAAAGCCCACCCGAATCTTGTAGAAATATTCAAACTAATA 
AAATCATGAATATTTTAA 



FIGURE 167 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA50920 
xsubunit 1 of 1, 295 aa, 1 stop 
xMWi 33518, pi: 7.74, NX{S/T) : 0 

MLQGPGSLLLLFIASHCCLGSARGLFLFGQPDFSYKRSNCKPIPVNLQLCHGIEYQNMRLPN 
LLGHETMKEVLEQAGAWIPLVMKQCHPDTKKFLCSLFAPVCLDDLDETIQPCHSLCVQVKDR 
Ca.PVMSAFGFPWPDMLECDRFPQDNDLCIPIASSDHLLPATEEAPKVCEACKNKNDDDNDIM 
ETLCKBTOFALKIKVKEITYINiyDTKIILETKSKTIYKLNGVSERDLKKSVLWLKDSLQCTCE 
EB4NDINAPYLVMGQKQGGELVITSVKRWQKGQREFKRISRSIRKLQC 

Important features: 
Signal peptide: 

amino acids 1-20 

Cysteine rich domain, homolgous to frizzled N terminus 
amino acids 6-153 



FIGURE 168 



GTGGAGGCCGCCGACGATGGCGGGGCCGACGGAGGCCGAGACGGGGTTGGCCGAGCCCCGGG 
CCCTGTGCGCGCAGCGGGGCCACCGCACCTACGCGCGCCGCTGGGTGTTCCTGCTCGCGATC 
AGCCTGCTCAACTGCTCC^CGCCACGCTGTGGCTCAGCTTTGCACCTGTGGCTGACGTCAT 
TGCTGAGGACTTGGTCCTGTCCATGGAGCAGATCAACTGGCTGTCACTGGTCTACCTCGTGG 
TATCCACCCCATTTGGCGTGGCGGCCATCTGGATCCTGQACTCCGTCGGGCTCCGTGCGGCG 
ACCATCCTGGGTGCGTGGCTGAACTTTGCCGGGAGTGTGCTACGCATGGTGCCCTGCATGGT 
TGTTGGGACCCAAAACCCATTTGCCTTCCTCATGGGTGGCCAGAGCCTCTGTGCCCTTGCCC 
AGAGCCTGGTCATCTTCTCTCCAGCCAAGCTGGCTGCCTTGTGGTTCCCAGAGCACCAGCGA 
GCCACGGCCAACATGCTCGCCACCATGTCGAACCCTCTGGGCGTCCTTGTGGCCAATGTGCT 
GTCCCCTGTGCTGGTCAAGAAGGGTGAGGACATTCCGTTAATGCTCGGTGTCTATACCATCC 
CTGCTGGCGTCGTCTGCCTGCTGTCCACCATCTGCCTGTGGGAGAGTGTGCCCCCCACCCCG 
CCCTCTGCCGGGGCTGCCAGCTCCACCTCAGAGAAGTTCCTGGATGGGCTCAAGCTGCAGCT 
CATGTGGAACAAGGCCTATGTCATCCTGGCTGTGTGCTTGGGGGGAATGATCGGGATCTCTG 
CCAGCTTCTCAGCCCTCCTGGAGCAGATCCTCTGTGCAAGCGGCCACTCCAGTGGGTTTTCC 
GGCCTCTGTGGCGCTCTCTTCATCACGTTTGGGATCCTGGGGGCACTGGCTCTCGGCCCCTA 
TGTGGACCGGACCAAGCACTTCACTGAGGCCACCAAGATTGGCCTGTGCCTGTTCTCTCTGG 
CCTGCGTGCCCTTTGCCCTGGTGTCCCAGCTGCAGGGACAGACCCTTGCCCTGGCTGCCACC 
TGCTCGCTGCTCGGGCTGTTTGGCTTCTCGGTGGGCCCCGTGGCCATGGAGTTGGCGGTCGA 
GTGTTCCTTCCCCGTGGGGGAGGGGGCTGCCACAGGCATGATCTTTGTGCTGGGGCAGGCCG 
AGGGAATACTCATCATGCTGGCAATGACGGCACTGACTGTGCGACGCTCGGAGCCGTCCTTG 
TCCACCTGCCAGCAGGGGGAG6ATCCACTTGACTGGACAGTGTCTCTGCTGCTGATGGCCGG 
CCTGTGCACCTTCTTCAGCTGCATCCTGGCGGTCTTCTTCCACACCCCATACCGGCGCCTGC 
AGGCCGAGTCTGGGQAGCCCCCCTCCACCCGTAACGCCGTGGGCGGCGCAGACTCAGGGCCG 
GGTGTGGACCGAGGGGGAGCAGGAAGGGCTGGGGTCCTGGGGCCCAGCACGGCGACTCCGGA 
GTGCACGGCGAGGGGGGCCTCGCTAGAGGACCCCAQAGGGCCCGGGAGCCCCCACCCAGCCT 
GCCACCGAGCGACTCCCCGTGCGCAAGGCCCAGCAGCCACCGACGCGCCCTCCCGCCCCGGC 
AGACTCGCAGGCAGGGTCO^GCGTCCAGGTTTATTGACCCGGCTGGGTCTCACTCCTCCTT 
CTCCTCCCCGTGGGTGATCACGTAGCTGAGCGCCTTGTAGTCCAGGTTGCCCGCCACATCGA 
TGGAGGCGAACTGGAACATCTGGTCCACCTGCGGGCGGGGGCGAAAGGGCTCCTTGCGGGCT 
CCGGGAGCGAATTACAAGC6CGCACCTGAAAA 



FIGURE 169 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA50988 
xsiibianit 1 of 1, 560 aa, 1 stop 
xMW: 58427, pi: 6.86, NX(S/T) : 2 

MAGPTEAETGLAEPRALCAQRGHRTYAEIRWVFLLAISLLNCSNATLWLSFAPVADVIAEDLV 

LSMEQINWLSLVYLWSTPFGVAAIWILDSVGLRAATILGAWLNFAGSVLRMVPCMWGTQN 

PFAFIMGGQSLCALAQSLVIFSPAKIiAALWFPEHQRATANMLATMSNPLGVLVAN^ 

KKGEDIPLMLGVYTIPAGWCLLSTICLWESVPPTPPSAGAASSTSEKFLDGLKLQLMWNKA 

YVILAVCLGGMIGISASFSALLEQILCASGHSSGFSGLCGALFITFGILGALALGPYVDRTK 

HFTEATKIGLCLFSLACVPFALVSQLQGQTLALAATCSLLGLFGFSVGPVAMELAVECSFPV 

GEGAATGMIFVLGQAEGILIMLAMTALTVRRSEPSLSTCQQGEDPLDWTVSLLLMAGLCTFF 

SCILAVFFHTPYRRLQAESGEPPSTKNAVGGADSGPGVDRGGAGRAGVLGPSTATPECTARG 

ASLEDPRGPGSPHPACHRATPRAQGPAATDAPSRPGRLAGRVQASRFIDPAGSHSSFSSPWVIT 

Importeuit features: 
Signal peptide: 

amino acids 1-44 

Transmembrane domains: 

amino acids 61-79, 98-112, 126-146, 169-182, 201-215, 248-268, 
280-300, 318-337, 341-357, 375-387, 420-441 

N-glycosylation site, 
amino acids 40-43 and 43-46 

Glycosaminoglycan attachment site. 

amino acids 468-471 



FIGURE 170 



TAGTlXrCCaGTAGGGGGTCTCCCCTGGa^TTCTTGATaSGCGTTTGGACaTC 
TGGCCraGCCTTGGGGTCCTGCTTGTTTCaTAATCaTCT^ 

TCTGaATCTAGCCCSlCTTGGCGGTJ^GCaTCATGCTACra 

C:TACTTATTTCTTTTAGGGGftTTGTCAGGAGGTGACCA.CTCT<:ACX3GTGAAATAC(:aAGTGTCAG^ 

ATCTGGTACAGTGATCGGGAAGCTGTCCCAGGAACTGGGCCGGGAGGAGAGGCGGAGGCAAGCTGGGGCCGCCTT 

CCAGGTGTTGCAGCTGCCTCAGGCGCTCCCCa.TTCAGGTGGACTCTGAGGAAGGCTTGCTCAGCA 

GCTGGATCGAGAGCAGCTGTGCCGACAGTGGQATCCCTGCCTGGTTTCCTTTGATGTGCTTGCCACAGGGGATTT 

GGCTCTGATCaVTGTGGAGATCa^GTGCTGGACATCAATGACO^CCAGCCmCGGTTTCCCAAAGG 

GCTGGAAATCrCTGAGAGCGCCTCTCTGCGAACCCGGATCCCCCTGGACa.GAGCTCTTGACCCAGACACaLGGC^ 

TAAC3iCCCTGCACACCTACaCTCTGTCTCCCAGTGAGCACTTTGCCTTGGATGTCa.TTGTGGGCCCTGATGAC^ 

CAAAaVTGa^GAACrcaTAGTGGTGAAGGaGCTGGACAGGGaAATCC^^ 

CTATGACSiATGGGAACCCCCCa^AGTCaGGTACCaGCTTGGTC^ 

CCCroCGTTTGCTGAGAGTT<aiCrGGCACTGGAAATCC3iAGAAGATGCTGC3VC^ 

GACCGCCAO^GACCCTGACCS^GGCCCCS^TGGGGAaSTGGAGTTCTTCCTCAGTAAGCaVCATGCCTCa^ 

GCTGGACACCTTCAGTATTGATGCCAAGACAGGCCAGGTCATTCTGCGTCGACCTCTAGACTATGAAAAGAACCC 

TGCCTACGAGGTGGATGTTCAGGCaAGGGACCTGGGTCCCAATCCTATCCCAGCCCATTGCS^GTTCTCATCaA 

GGTTCTGGATGTCAATGACaACATCCCaAGCATCa^CGTCACATGGGCCTCCCAGCCATCACTGGTGTC^ 

TCTTCCCAAGGAO^GTTTTATTGCTCTTGTCATGGCAGATGACTTGGATTCAGGAaiGAATG^ 

CTGGCTGAGCCAAGAGCTGGGCCACTTCAGGCTGAAAAGAACTAATGGCaACACATACATGTTGCTAACCa^^ 

CACa^CTGGACAGAGAGCaGTGGCCCaAATATACCCTCACTCTGTTAGCCCAAGACCAAC^ 

AGCO^GAAACAGCTa^GCATTCSVGATCaGTGACaTa^ 

AGTCrCCACGCGGGAAAAa^CTTACCCTCTCracaa:TCaTTACCAT 

TAATGGAAAAGTCTCATACCGGATCCaGGACTCCCa^GTTGCTa^CTTAGTAGCTATTGACTCCa^ 

GGTCACTGCTCAGAGGTCACTGAACTATGAAGAGATGGCOKSCTTTGAGTTCCAGGTGATCGCA^^ 

GCaACCCATGCTTGOlTCCAGTGTCTCTGTGTGGGTaVGCCTCTTGGATGCCAATGATAATGCCCCAQAGGTGGT 

CCAGCCTGTGCTCAGCGATGGAAAAGCaiGCCTCTCCGTGCTTGTGAATGCCTCCACAGGCCACCTGCTGGTGCC 

CATCGAGACTCCCaATGGCTTGGGCCCAGCGGGCACTGACACaCCTCCACTGGCCACTCACTlGCTCCC^ 

CCTTTTGACAACCaTTGTGGCJ^GAGATGCAGACrCGGGGGCAAATGGAGAGCCCCTCTACaG 

AAATGAAGCCCACCTCTTCATCCTCAACCCTCATACGGGGCAGCTGTTCGTCaATGTCACCS^TGCCAGCA^ 

CaTTGGGAGTGAGTGGGAGCTGGAGATAGTAGTAGAGQACCAGGGAAGCCCCCCCTTACAGACCCQ^ 

GAGGGTCATGTTTGTCACCAGTGTGGACCACCrGaGGGACTa^CCCGC^ 

GCTGAa3»rGATCTGCCTGGCTGTACroTTGGGa^TCfrTCX3^ 

GACAGAAAAGAAGGACAACAGGGCCTACAACTGTCGGGAGGCCGAGTCCaLCCrACCGCCAGCA^ 

CCaGAAACACArraVGAAGGCaGACATCCaCCTCGTGCCTGTGCTCAGGGGTCAGGCAGGTGAGC 

CGGGCAGTCCCACAAAGATGTGGACAAGGAGGCGATGATGGAAGCS^GGCTGGGACCCCTGCCTGCAGGCCCCCTT 

CCACCTa^CCCCGACCCTGTACAGGACGCTGCGTAATCAAGGCAACCAGGGAGa^CCGGCGGAGAGCCGAGA 

GCTGCAAGACACGGTO^CCTCCTTTTCAACCATCCCAGGCAGAGGAATGCOTCCCGGGAGAACCTGAACCTTCC 

CGAGCCCCAGCCTGCCaLCAGGCCAGCCACGTTCCAGGCCTCTGAAGGTTGCAGGCAGCCCCa^CAGGGAGGCTGGC 

TGGAGACaiGGGa^GTGAGGAAGCCCCACAGAGGCCACCAGCCTCCTCTGaU!.CCCTGAGACGGCAGCGAC^ 

TGCCTTCGCCGAGOSGAACCCCGTGGAGGAGCTCa^CTGTGGATTCTCCTCCTGTTCAC^^ 

GTCCTTGCTGCa^TCayGGGCCaATTCaWXICCAAACCAAACCAC^^ 

CaGCAGGAGTGO^TCCCAGACaCAGATGGCCCS^GTGCAAGGGCTGGAGGCCaG^ 

AGGGCCTTTGGATCCTGAAGAGGACCTCTCTGTGAAGCAACTGCTAGAAGAAGAGCTGTCAAGTCTGCTGGACCC 

CAGCACAGGTCTGGCCCTGGACCGGCTGAGCGCCCCTGACCCGGCCTGGATGGCGAGACTCTCTTTGCCCCTCAC 

CACCAACTACCGTGAGAATGTGATCTCCCCGGATGCTGCAGCCACGGAGGAGCCGAGGACCTTCCAGACGTTCGG 

C^iAGGCAGAGGCACCaGAGCTGAGCCCAACAGGCACGAGGCTGGCCaGCACCTTTGTCTCGGAGATGAGCTCACT 

GCTGGAGATGCTGCTGGAACAGCGCTCCAGCATGCCCGTGGAGGCCGCCTCCGAGGCGCTGCGGCGGCTCTCGGT 

CTGCGGGAGGACCCTCAGTTTAQACTTGGCCACCAGTGCaGCCTCaGGCATGAAAGTGCAAGGGGACCCAGGTGG 

AAAGACGGGGACTGAGGGCAAGAGCAGAGGCAGCa^GCa^GCaVGCAGCAGGTGCCTGTGA^ 

CTGG3^TCCAAGAACCaGGGGCCTGAGGATCTGTGGAa«VGAGCTGGTTTCTAAAATCra 

CGGCSSGCCTGAGAACTTTAGGGTGACTGATGCTAO^CCa^CAGAGGAGGC^^ 

TGACaU^GCAGCCCrTTGTAAGCaGCrCTGAGTCTTTTGGAGGAa^^ 

TCCTGGCS^AAACS^TATGTGGAGCACAAAGGGTCAGTCCrCTGGCAGAAa^ 

AAAGGGTGGCCTTCTTGGGTAGCAGGAGTCAGGGGGCrGTACCCrGGGGGTGCC^^ 

CAATAAAGGAAAAGCA6TAA2VAAAAAAAAAAAAAAAAA 
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</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA48331 
<subunit 1 of 1, 1184 aa, 1 stop 
<MW: 129022, pi: 5.20, NX(S/T) : 5 

MMQLLQLLLGLLGPGGYLFLLGDCQEVTTLTVKYQVSEEVPSGTVIGKLSQELGREERRRQA 
GAAFQVLQLPQALPIQVDSEEGLLSTGRRLDREQLCRQWDPCLVSFDVLATGDLALIHVEIQ 
VLDINDHQPRFPKGEQELEISESASLRTRIPLDRALDPDTGPNTLHTYTLSPSEHFALDVIV 
GPDETKHAELIVVKELDREIHSFFDLVLTAYDNGNPPKSGTSLVKVNVLDSNDNSPAFAESS 
LALEIQEDAAPGTLLIKLTATDPDQGPNGEVEFFLSKHMPPEVLDTFSIDAKTGQVILRRPL 
DYEKNPAYEVDVQARDLGPNPIPAHCKVLIKVLDVNDNIPSIHVTWASQPSLVSEALPKDSF 

Pl lALVMADDLDSGHNGLVHCWLSQELGHFRLKRTNGNTYMLLTNATLDREQWPKYTLTL 

QGLQPLSAKKQLSIQI SDINDNAPVFEKSRYEVSTRENNLPSLHLITIKAHDADLGINGKVS 
YRIQDSPVAHLVAIDSNTGEVTAQRSLNYEEMAGFEFQVIAEDSGQPMLASSVSVWVSLLDA 

2 NDNAPEWQPVLSDGKASLSVLVN7VSTGHLLVPIETPNGLGPAGTDTPPLATHSSRPFLLTT 

U IVARDADSGANGEPLYSIRNGNEAHLFILNPHTGQLFVNVTNASSLIGSEWELEIWEDQGS 
PPLQTRALLRVMFVTSVDHLRDSARKPGALSMSMLTVI CLAVLLGIFGLILALFMS I CRTEK 

Ci KDNRAYNCREAESTYRQQPKRPQKHIQKADIHIA7PVLRG0AGEPCEVGQSHKDVDK^^ 

^i; GWDPCLQAPFHLTPTLYRTLRNQGNQGAPAESREVLQDTVNLIiFNHPRQRNASRENLNLPEP 

4=. 

Ci QPATGQPRSRPLKVAGSPTGRLAGDQGSEEAPQRPPASSATLRRQRHLNGKVSPEKESGPRQ 

y- 

* ILRSLVRLSVAAFAERNPVEELTVDSPPVQQISQLLSLLHQGQFQPKPNHRGNKYLAKPGGS 
RSAIPDTDGPSARAGGQTDPEQEEGPLDPEEDLSVKQLLEEELSSLLDPSTGIjALDRLSAPD 
PAWMARLSLPLTTNYRDNVISPDAAATEEPRTFQTFGKAEAPELSPTGTRLASTFVSEMSSL 
LEMLLEQRSSMPVEAASEALRRLSVCGRTLSLDLATSAASGMKVQGDPGGKTGTEGKSRGSS 
SSSRCL 



Important features: 
Signal peptide: 

amino acids 1-13 
Transmembrane domain: 

amino acids 719-739 
N-glycosylation site. 

amino acids 415-418, 582-585, 659-662, 662-665 amd 857-860 

Cadherins extracellular repeated domain signature. 

amino acids 123-133, 232-242, 340-350, 448-458 and 553-563 



FIGURE 172 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGG 
CAGACCGTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAG 
GCAGGAGCCTTCCTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCT 
CCCAGATACTATTTTTTGGATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTAT 
GAGATACGTCAGTATGTTGTACAGGTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCAT 
GTTTGAGCTCATCATCTTTGAAATCTTAGGAGTATTGAATAGCAGCTCCCGTTATTTTCACT 
GGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGGTTTTCATGGTGCCTTTTTACATTGGC 
TATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGACTGCTTTTTTCCTGTCTCTT 
ATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCCCATTCTCAGCCCAA 
AACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAGTGACTCTC 
ATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTCCT 
CAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATA 
TGATCATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAA 
GTGCATAACAAACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGG 
AAGTQAAAATCTTACTCTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGC 
TTTTTCTGGATIACAGCTGATCTATATGCTACCAAGGAGAGAATAGAATACTCCAAAACCTTC 
AAGGGQAAATATTTTAATTTTCTTGGTTACTTTTTCTCTATTTACTGTGTTTGGAAAATTTT 
CATGGCTACCATCAATATTGTTTTTGATCGAGTTGGGAAAACGGATCCTGTCACAAGAGGCA 
TTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGTGAAGTTTTGGTCCCAACACATT 
TCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGATTGCTGATCACTCTTAC 
CAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTGCTATTAGCAC 
AGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTTAGAA 
TACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTT 
TGATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAAC 
AGQCACCAGAGAAGCAAATGGCACC TTGAA CTTAAGCCTACTACAGACTGTTAGAGGCCAGT 
GGTTTCAAAATTTAGATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAAC 
A2\ACAAAATGCTATGGTAGCATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATA 
CTATGACCATGAGTAGCATO^GCCAGAACATGAGAGGGAGAACTAACTCAAGACAATACTCA 
GCAGAGAGCATCCCGTGTGGATATGAGGCTGGTGTAGAGGCGGAGAGGAGCCAAGAAACTAA 
AGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGTCTATGGTAGCTGAGCCAAACACGT 
AGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTGCCTTGAGATTGACTCATT 
AAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCGACTCTAGAGTCG 
ACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 



FIGURE 173 



MSFLIDSSIMITSQILFFGFGWLPFMRQLFKDyEIRQYWQVIFSVTFAFSCTMFELIIFEI 

IiGVIJSrSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLL^ 

WKLGDPFPILSPKHGILSIEQLISRVGVIGVTIJyiALLSGFGAWCPYTyMSyFLRNVTDTDI 

IJU^ERRLI.QT^mMIISKKKRMAMMlRTMFQKGEVHNKPSGFWGMIKSVTT^^ 

QEVDALEELSRQLFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVF 

DRVGKTDPVTRGIEITVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISS 

SKSSNVIVLLIAQIMGMYFVSSVLLIRMSMPLEYRTI ITEVLGELQFNFYHRWFDVI FLVSA 

LSSILFLYLAHKQAPEKQMAP 

Important features: 
Signal peptide: 
amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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CATGGGAAGTGGAGCCGGAGCCTTCCTTACak.CTCGCCATGAGTTTCCT(::aiTCGACTCCAGCA 
TCATGATTACCTCCCaJGANACTATTTTTTGGATTTGGGTGGCTTTTCTTaSGCGCCJ\A.TGTT 
TAAAGACTATGAGATACGTCAGTATGTTGTACNGGTGATCTTCTCCGTGACGTTTGCCATTT 
CTTGCACCATGTTTGAGCTCATCATCTTTGA2^TCTTNGGAGTATTGAATAGCAGCTCCCGT 
TATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGGTTNTCATGGTGCCTTT 
TTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGACTGCTTTTTT 
CCTGTCTCTTATGGCTGACCTTTATGTATTTCCAG 
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GTGTTGCCCTTGGGGAGGGGAAGGGGAGCCNGGCCCTTTCCTAAAATTTGGCCAAGGGTTTC 

TTTNTTGAATTCCGGGTTNNGNATACCTTCCCAGAAAATATTTTTTGGATT^ 

TTTTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAGGTGATNTT 

NTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATNTTTGAAATNTTAGGAG 

TATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATC 

CTGGTTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCA 

TAAACAACGACTGCTTTTTTCCTGTCTNTTATGGCTGACCTTTATGTATTTNTTNTGGAAAN 

TAGGAGATCCCTTTCCCATTCTC 



FIGURE 176 



CTCGCGCAGGGATCGTCCCaSGGCCGGGGCTCGGAGCCGCGACCCTTGGGGGGCCTCCGGGATTTGCTAC^ 

TGGCTCCCTGCTCGTCGAACTGCTCTTCTCACGGGCTGTCGCCTTCAATCTGGACGTGATGGGTGCCTTGCGCAA 

GGaGGGCGAGCCAGGCaG(XrrCTTCX3GCTTCTCTGTGGCCCTaZACCGGC^ 

CCCGTTGAGCCTGGaGGAGACTGACTGOTACaQAGTGGACATCGACCAGGGAGCT^^ 

GGAGAACCAGTGGTTGGGAGTCAGTGTTCGGAGCCAGGGGCCTGGGGGCaAGATTGTTACCTGTGCACACCGATA 

TGAGG<:aAGGCAGCGAGTGGACCAGATCCTGGAGACGCGGGATATGATTGGTCGCTGCTTTGTGCT(:a.GC(^ 

CCTGGCCATCCGGGATGAGTTGGATGGTGGGGAM-GGaAGTTCTGTGAGGGACGCCCCO^GGCCATGJ^C^ 

TGGGTTCTGCCAGCAGGGCACAGCTGCCGCCTTCTCCCCTGATA6CCACTACCTCCTCTTTGGGGCCCCAGGAAC 

CTATAATTGQZiAGGGCACGGCCAGGGTGGAGCTCTGTGCACAGGGCTCAGCGGACCTGGCACACCTGGACGACGG 

TCCCTACGAGGCGGGGGGAGAGAAGGAGOlGGACCCCCGCCTCATCCCGGTCCCTGCCAAa^GCTACTTTGGCTT 

CrCTATTGACTa3GGGAAAGGTCTGGTGCGTGCAGAAGAGCTGAGCTTTGTGGCTGGAGCCCCCCGaK;C^ 

CAAGGGTGCTGTGGTCaTCCroCGCaAGGACAGCXKICaGTaXIC^^ 

CCTGACCTCCXSGCrrTGGCTACTCACTGGCTGTGGOTGACCrcaA^^ 

TGCCCCCTACTTCTTTGAGCGCO^GAAGAGCTGGGGGGTGCTGTGTATGTGTACr^^ 

GGCTGGGATCTCCCCTCTCCGGCTCTGCGGCTCCCCTGACTCCATGTTCGGGATCAGCCTGGCTGTCCTGGGGGA 

CCTCaUi.CCAAGATGGCTTTCaVGATATTGa^GTGGGTGCCCCCrTTGATGGTGATGGGAAAGTCTTCATCTACCA 

TGGGAGCAGCCTGGGGGTTGTCGCCAAACCTTCACAGGTGCTGGAGGGCGAGGCTGTGGGCATCaAGAGCTTC 

CTACTCCCTGTCAGGCAGCTTGGATATGGATGGGAACCAATACCCTGACCTGCTGGTGGGCTCCCTGGCTGACAC 

CGCAGTGCTCTTO^GGGCCS^GACCCATCCTCCATGTCrCCCATGAGGTCTCTATTGCTCCACGAAGCATCGACCT 

GGAGCaGCCCAACTGTGCTGGCGGCCACTCGGTCTGTGTGGACCTAAGGGTCTGTTTCAGCTACATTGCAGTCCC 

CAGCAGCTATAGCCCTACTGTGGCCCTGGACTATGTGTTAGATGCGGACaCAGACCGGAGGCTCCGGGGCCAGGT 

TCCCCCTGTGACGTTCCTGAGCCGTAACCTGGAAGftACCCAAGCACCaGGCCTaSGGCACCGTG^^ 

CCAGCATGACCGAGTCTGTGQAGACGCCATGTTCCAGCTCCAGGA?UUVT^ 

AGTCACCTTGTCCTACAGTCTCCaGACCCCTCGGCTCCGGCXSACaGGCT 

CCCCATCCTO^TGCCCaCCAGCCCAGCACCCAGCGGGCAGAGATCCACTTCCTGAAGC^ 

CAAGATCTGCCAGAGCAATCTGaVGCTGGTCCACGCCCGCTTCTGTACCCGGGTCAGCGACACGGAATTCC^ 

TCTGCCCATGGATGTGGATGGAACAACAGCCCTGTTTGCACTGAGTGGGCAGCCAGTCATTGGCCTGGAGCTGAT 

GGTCACCAACCTGCCATCGGACCCAGCCCAGCCCCAGGCTGATGGGGaTGATGCCCATGAaJSCCa^^ 

TGaGAATGCCTCCa^TGTTGAGTGTGAGCTGGGGAACCCCATGAAGAGAGGTGCCCAGOTCACCTTCTACCT 

(XOTAGCACCTCCGGGATCaGa^TTGaGACai(XGaACTGGAGGTAGAGCTGC^^ 

GGAGCTGCATCCAGTCTCTGCACGaGCCCGT6TCTTaATTGAGCTGCa>LCTGTC^^ 

CO^GCAACTCTTCTTCTCTGGTGTGGTGAGGGGCXSAGAGAGCCATGCAGTCrGAGCX^^ 

CAAGTATGAGGTCACGGTTTOiaACCaAGGCCS^GTaSCTCaGAACCCTGGGCTC^^ 

GCCTCATGAGATTGCCaATGGGAAGTGGTTGCTGTACCCAATGCAGGTTGAGCTGGAGGGCGGGCmGGGGCCTG^ 
GCAGAAAGGGCTTTGCTCTCCCAGGCCC?Ui.C:ATCCTCCa^CCTGGATGTGGACAGTAGGGA 

GCTGGAGCCACCTGAGCykGCAGGAGCCTGGTGAGCGGCAGGAGCCCAGCATGTCCTGGTGGCCAGTGTCCTCTGC 

TGAGaAGAAGAAAAACaTCaLCCCTGQACTGCGCCCGGGGCACGGCCAACTGTGTGGTGTTCAGCTGCCC^ 

CAGCTTTGACCGCGCXSGCTGTGCTGCaVTGTCTCGGGCCGTCTCTGGAACAGa^ 

TGTGAAGTCCCTGGAAGTGATTGTCCXSGGCCaACATCaVCAGTGAAGTCCTCCaTAAAGi^^ 

TGCOTCCAO^GTGATCCCAGTGATGGTATACTTGGACCCCaTGGCTGTCGTGGCAG 

CATCCTCCTGGCTGTACTGGCTGGGCTGCTGGTGCTAGCSICTGCTGGTGCTGCTCCTGT 

CAAACGGGCGAAGCacCCCGAGGCCACCGTGCCCCAGTACCATGCGGTGAAGATTCCTCGGG^ 

GTTCaAGGAGGAGAAGACGGGCACCATCCTGAGGAACAACTGGGGCAGCCCCCGGCGGGAGGGCCaSGATGC^ 

CCCCATCCTGGCTGCTGACGGGCATCCCGAGCTGGGCCCCGATGGGCy^TCCAGGGCCAGGCaCaSCCSAgGTTCC 

CaTGTCCCaLGCCTGGCCTGTGGCTGCCCTCO^TCCCTTCCCCAGAGATGGCTCCTTGGGATQAAGAGGGTAGAGT 

GGGCTGCTGGTGTCGCATCAAGATTTGGCAGGATCGGCTTCCTCa.GGGGC::ACaGACCTCTCCCACCa^ 

TCCTCCCaCCCT^CTTCCCCTTAGAGTGCTGTGAGATGAGAGTGGGTAAATCAGGGACAGGGCCATGGGGTAGG^ 

TGAGAAGGGCaGGGGTGTCCTGATGa^GGTGGGGAGAAGGGATCCTAATCCCTTC^^ 

GTAACSVGGACCCCAAGGACCTGCCTCCCCGGAAGTGOriTAACCT^ 

CTCRGGCTGCTCCTTCrCTAGTTTCCCCTCTCATCTCaCCT^^ 

TTCGTCrATTTATTAAAAAATATTTGAGAACAAAAAAAJ^AAAAAftAAAaAA 



FIGURE 177 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA55737 
xsubunit 1 of 1, 1141 aa, 1 stop 
xMW: 124671, pi: 5.82, NX(S/T) : 5 

MAGARSRDPWGASGICYLFGSLLVELLFSRAVAFNLDVMGALRKEGEPGSLFGFSVALHRQL 
QPRPQSWLLVGAPQALALPGQQANRTGGLFACPLSLEETDCYRVDIDQGADMQKESKENQWL 
GVSVRSQGPGGKIVTCaHRYEARQRVDQILETRDMIGRCFVLSQDLAIRDELDGGEWKFCEG 
RPQGHEQFGFCQQGTAAAFSPDSHYLLFGAPGTYlIWKGTARVELCAQGSADLAHLDDGPyEA 
GGEKEQDPRLIPVPANSYFGFSIDSGKGLVRAEELSFVAGAPRANHKGAWILRKDSASRLV 
PEVMLSGERLTSGFGYSLAVADLNSDGWPDLIVGAPYFFERQEELGGAVYVYLNQGGHWAGI 
SPLRLCGSPDSMFGISLAVLGDLNQDGFPDIAVGAPFDGDGKVFIYHGSSLGWAKPSQVLE 
GEAVGIKSFGYSLSGSLDMDGNQYPDLLVGSLADTAVLFRARPILHVSHEVSIAPRSIDLEQ 
U: PNCAGGHSVCVDLRVCFSYIAVPSSYSPTYALDYVLDADTDRRLRGQVPRVTFLSRNLEEPK 
HQASGTWLKHQHDRVCGDAMFQLQENVKDKLRAIVVTLSYSLQTPRLRRQAPGQGLPPVAP 
f5 ILNAHQPSTQRAEIHFLKQGCGEDKICQSNLQLVHARFCTRVSDTEFQPLPMDVDGTTALFA 
M LSGQPVIGLELMVTNLPSDPAQPQADGDDAHEAQLLVMLPDSLHYSGVRALDPAEKPLCLSN 
ENASPT^ECELGNPMKRGAQVTFYLILSTSGISIETTELEVELLLATISEQELHPVSARARVF 
Ci lELPLSIAGMAIPQQLFFSGVVRGERAMQSERDVGSKYKYEVTVSNQGQSLRTLGSAFLNIM 
i%i WPHEIANGKWLLYPMQVELEGGQGPGQKGLCSPRPNILHLDVDSRDRRRRELEPPEQQEPGE 
y- RQEPSMSWWPVSSAEKKKNITLDCARGTANCWFSCPLYSFDRAAVLHVWGRLWNSTFLEEY 
s SAVKSLEVIVRANITVKSSIKNLMLRDASWIPVMVYLDPMAWAEGVPWWVILLAVLAGLL 
1^=5. VIJ^LVLLLWKMGFFKRAKHPEATVPQYHAVKIPREDRQQFKEEKTGTILRinJWGSPRREGP 
O DAHPIIiAADGHPELGPDGHPGPGTA 

m 

J: 

Q Xmportant features: 
N Signal peptides 

amino acids 1-33 



Transmembreuie daonain: 
amino acids 1040-1062 



N-glycosylation sites. 

amino acids 86-89, 746-749, 949-952, 985-988 and 1005-1008 



Integrins alpha chain proteins. 

amino acids 1064-1071, 384-408, 1041-1071, 317-346, 443-465, 385- 
407, 215-224, 634-647, 85-99, 322-346, 470-479, 442-466, 379-408 
and 1031-1047 



FIGURE 178 



CGCGCCGGGCGCAGGGAGCTGAGTGGACGGCTCGAGACGGCGGCGCGTGCAGCAGCTCCAGA 

AAGCAGCGAGTTGGCAGAGCAGGGCTGCy^TTTCCAGa^GGAGCTGCGAGCACAGTGCTGGCT 

CACaACAAGATGCTCAAGGTGTCSlGCCGTACTGTGTGTGTGTGCAGCCGCTTGGTGCAGTCA 

GTCTCTCGCAGCTGCCGCGGCGGTGGCTGCAGCCGGGGGGCGGTCGGACGGCGGTAATTTTC 

TGGATGATAAACAATGGCTCACCACAATCTCTCA6TATGACAAGGAAGTCGGACAGTGGAAC 

AAATTCCGAQACGAAGTAGAGGATGATTATTTCCGCACTTGGAGTCCAGGAAAACCCTTCGA 

TCAGGCTTTAGATCCAGCTAAGGATCCATGCTTAAAGATGAAATGTAGTCGCCATAAAGTAT 

GCATTGCTCAAGATTCTCAGACTGCAGTCTGCATTAGTCACCGGAGGCTTACACACAGGATG 

AAAGAAGCAGGAGTAGACCATAGGCAGTGGAGGGGTCCCATATTATCCACCTGCAAGCAGTG 

CCCAGTGGTCTATCCCAGCCCTGTTTGTGGTTCAGATGGTCATACCTACTCTTTTCAGTGCA 

AACTAGAATATCAGGCATGTGTCTTAGGAAAACAGATCTCAGTCAAATGTGAAGGACATTGC 

CCATGTCCTTCAGATAAGCCCACCAGTACAAGCAGAAATGTTAAGAGAGCATGCAGTGACCT 

GGAGTTCAGGGAAGTGGCAAACAGATTGCGGGACTGGTTCAAGGCCCTTCATGAAAGTGGAA 

GTCAAAACAAGAAGACAAAAACATTGCTGAGGCCTGAGAGAAGCAGATTCGATACCAGCATC 

TTGCCAATTTGCAAGGACTCACTTGGCTGGATGTTTAACAGACTTGATACAAACTATGACCT 

GCTATTGGACCAGTCAGAGCTCAGAAGCATTTACCTTGATAAGAATGAACAGTGTACCAAGG 

CATTCTTCAATTCTTGTGACAC^TACAAGGACAGTTTAATATCTAATAATGAGTGGTGCTAC 

TGCTTCCAGAQACAGCAAGACCCACCTTGCCAGACTGAGCTCAGCAATATTCAGAAGCGGCA 

AGGGGTAAAGAAGCTCCTAGGACAGTATATCCCCCTGTGTGATGAAGATGGTTACTACAAGC 

CAACACAATGTCATGGCAGTGTTGGACAGTGCTGGTGTGTTGACAGATATGGAAATGAAGTC 

ATGGGATCCAGAATAAATGGTGTTGCAGATTGTGCTATAGATTTTGAGATCTCCGGAGATTT 

TGCTAGTGGCGATTTTCATGAATGGACTGATGATGAGGATGATGAAGACGATATTATGAATG 

ATGAAGATGAAATTGAAGATGATGATGAAGATGAAGGGGATGATGATGATGGTGGTGATGAC 

CT^TGATGTATAC^TTTGATTGATGACAGTTGAAATCa^TAAATTCTACATTTCTAAT^^ 

CAAAAATGATAGCCTATTTAAAATTATCTTCTTCCCCAATAACAAAATGATTCTAAACCTCA 

CATATATTTTGTATAATTATTTGAAAAATTGCAGCTAAAGTTATAGAACTTTATGTTTAAAT 

AAGAATCATTTGCTTTGAGTTTTTATATTCCTTACACAAAAAGAAAATACATATGCAGTCTA 

GTCAGACAAAATAAAGTTTTGAAGTGCTACTATAATAAATTTTTCACGAGAACAAACTTTGT 

AAATCTTCCATAAGCAAAATGACAGCTAGTGCTTGGGATCGTACATGTTAATTTTTTGAAAG 

ATAATTCTAAGTGAAATTTAAAATAAATAAATTTTTAATGACCTGGGTCTTAAGGATTTAGG 

AAAAATATGCATGCTTTAATTGCATTTCCAAAGTAGCATCTTGCTAGACCTAGATGAGTCAG 

GATAACAGAGAGATACCACATGACTCCAAAAAAAAAAAAAAA 



FIGURE 17 9 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA49829 
xsubimit 1 of 1, 436 aa, 1 stop 
xMW: 49429, pi: 4.80, NX(S/T) : 0 

MLKVSAVLCVCAAAWCSQSLAAAAAVAAAGGRSDGGNFLDDKQWLTTISQYDKEVGQWNKPR 
DEVEDDYFRTWSPGKPFDQALDPAKDPCLKMKCSRHKVCIAQDSQTAVCISHRRLTHRMKEA 
GVDHRQWRGPILSTCKQCPWYPSPVCGSDGHTYSFQCKLEYQACVLGKQISVKCEGHCPCP 
SDKPTSTSRNVKRACSDLEFREVANRLRDWFKALHESGSQNKKTKTLLRPERSRFDTSILPI 
CKDSLGWMFNRLDTim)LLLDQSELRSIYLDKNEQCTKAFFNSCDTYKDSLISNNEWCYCFQ 
RQQDPPCQTELSNI QKRQGVKKLLGQYI PLCDEDGYYKPTQCHGSVGQCWCVDRYGNEVMGS 
RINGVADCAIDFEISGDFASGDFHEWTDDEDDEDDIMBTOEDEIEDDDEDEGDDDDGGDDHD 

Important features: 
Signal peptide: 

amino acids 1-16 

Leucine zipper pattern, 
amino acids 246-267 

N-myristoylation sites. 

amino acids 357-362, 371-376 and 376-381 

Thyr ©globulin type-1 repeat proteins 

amino acids 353-365 and 339-352 



FIGURE 180 



C3WSACTCCAGATTTCCCrGTC3^CCa^CGaGGAGTCaiLGi«mGGAAA.C^ 
CTCTTTCaGCCOKSGATCGCCCCaGCJaGGGaTOGGCraaCA^ 

GCTCTGCCTCCGGTGCTGCTGCCTGGGGCGGCCGGCTTCAa^CCrTCCCTaSATJWSaaAC^ 

CCCGCCGGCCaiGAAGGAGTGCTTCTAC(:»,GCCai,TGCCCCTOAAGGCCTCGCTGGAGATC^ 

GATGGAGCAGGATTAGATATTGATTTCCATCTTGCCTCrCCAGaAGGCAAAACCTTAGT^ 

TCAGATGGAGTTCaCACTGTAGAGACTGAAGTTGGTGAraACATGTTCTGCTTTGACAATACaTTGAG^ 

TCTGAGAAGGTGATTTTCTTTGAATTAATCCTGGATAATATGGGaGAAOVGGCACaAGAACAAGAAGATTGGAAG 

AAATATATTACTGGCaCAGATATATTGGATATGAAACTGGAAGACATCCTGGAATCCATCaACAGCATCAAGTCC 

AGACTAAGCAAAAGTGGGCACATACAAATTCTGCTTAGAGaiTTTGAAGCTCGTGATCGAAACATAC^ 

AACTTTGATAGaGTCAATTTCTGGTCTATGGTTAATTTAGTGGTCATGGTGGTGGTGTCAGCCATTCAAGTTTAT 

ATGCTGAAGaGTCTGTTTGftAGATAAGAGGAAAAGTAGAAC TTAAA ACTCCaAACTAGAGTACGTAACATTGAAA 

AATGAGGCATAAAAATGCAATAAACTGTTACAGTCAAGACCATTAATGGTCTTCTCCaAAATATTT^ 

AAAGTAGGAAAOlGGTATAATTTTAATGTGftAAATTAAGTCTTCakCTTTCTGTGa^ 

TTGTACTT AAGT GTGTAACAGGAATATTTTXXS^GAATATAGGTTTAACTGAATGftAGCC^ 

TTTCCTAACTTTGAAAAATTTTGCSiAATGTCTTAGGTGATTTAAATAAATGAGTAOT 

AGTCTGTTTTTAAOlGGTTCTATTACCCAGZ^OTTTTTTGTAAATGCGGCAGTTACAAATTi^ 

TCAGTTTTAAGTTATAAATCa^CCTGAGAATTACCTAATGATGGATTGAATAAATCTTTAGACTACAAAAGCCCAA 

CTTTTCTCTATTTACaTATGCATCTCTCCTATAATGTAAATAGAATAATAGCTTTGAAATACAATTAGGTTTTTG 

AGATTTTTATAACCAAATACATTTCAGTGTAACATATTAGCAGAAAGCATTAGTCTTTGTACTTTGCTTACATTC 

CCaAAAGCTGACaTTTTCACGATTCTTAAAAACACaAAGTTACACTTACTAAAATTAGGACATGTTTTC^ 

T^TGAAGAATATAGTTTAAAAGCTTCCTCCTCCATAGGGACACATTTTCTCTAACCCTTAACTAAAGTGTAGGA 

TTTTAAAATTAAATGTGAGGTAAAATAAGTTTATTTTTAATAGTATCTGTCS^GTTAATATCTGTCAAC^ 

TAATCATGTTATGTTAATTTTAACaTGATTGCTGACTTGGATAATTC^ 

TTGCTAAAATGATCTGGGCCTACCATAAATAAATATCTCXn'TTTCTGAGCTCT^^ 

AGAATTTAGAAf^AACTTGAOaAAACCTAATCXa^AAATAAAATTCACTTAAGTAGAACTAT^^ 

ATCTGACTGGCTO^TCATGAa^TCCTACTCATAACATAAATCAAAGGAGATGATTAATTTCCA^^ 

AAACTTTGGCTGTAGGTTTTTATTTTCmLCaAGAATTCTGGTTTGAATTATTTTTGTAAGCAGGTACATTTT^ 

AAATGTAAGCCCTACTGTAAGGTTTAGCACTGGGTGTACATATTTATTAAAAATTTTTATTATAACAACTTTTAT 

TAAAATGGCCTTTCTGAACACTTTATTTATTGATGTTGAAGTAAGGATTAGAAACa^TAGACTCCC»LAGTTTTA^ 

CACCnSiAATGTGAATAACCCATATATACaACaAAGTTTCTGCCATCTAGCTTTTTGAAGTCTATGGGGGTCTO 

TOUVGTACTAGTAATTTAACTTCykTCATGaATGAACTATAATTTTTAAGTTATGCCCATTTATAACG^ 

GACTACATTGTGAGTTAGAAAO^CTTAAAATTTGGGGTATAGAACCCCTCAACAGGTTAGTAATGCT^^ 

CTTGATGAGCS^TAATGATAACCT^GaGTGATTTCaTTTACACTCaTAGTAGT^^ 

TCTTAGGCCCCTGGGAGAAGAGa«Kn?TAGATTTCCOTACT?GGCAAGGTTTTTAAAAATG^ 

ATGATaVATTACCTTAATTGGa2AAGAAAATGCTTCAGGTGTCTAGGG6TATCCTCTG<^ 

AGGTCaATAAGATCCTTGCCTATGAATACCCCTCCCTTTTGCGCTGTTAAATTTGCAATGAGAAGCA^ 

GTACCATAACTAATAAAGCAGGGTACAGATATAAACTACTGCATCTTTTCTATAAAACTGTGATTAAGAATTOTA 

CCTCTCCTGTATGGCTGTTACTGTACTGTACTCTCTGACTCCTTACCTAACAATGAATTTGTTACATAATCTTCT 

ACATGTATGATTTGTGCCaCTGATCTTAAACCTATGATTa^GTAACTTCTTACCATATAAAAACGATAATTGCTT 

TATTTGGRAAAG2^TTTAGGAATACTAAGGACS\ATTATTTTTATAGACAAAGTAAAAAGACAGATATTTAAGAGG 

CATAACCaAAAAAGOUiAACTTGTAAACAGAGTAAAAATCTTTAATATTTCTAAAGACATACTGTTTATCT^ 

CaTATGCTTTTTTTAATTTCACTATTCG^TTTCTAAATTAAAGTTATGCT 

AACAGCTCATTTTGTCTTTTTCAATATACS^TTTTAAAAATACTACaATATT 

CATAATGTAGCAGTTACCGTGTTCACCTCaCACTAAGGCCrAQAGTTO 

GTTATGCTGTTCTTTCATGTGftATGTCaAGACATGGAGGGTGTTTGTAATTTTATGGTAAAATTAATCCT^^ 

CAa^TAATGGTGTCTTAAAATTGACAAAAAATGAGaVCTTACAATT6TATGTOT(Xn?CA^ 

GTGAAATTTTAAAAGACaTTQATTCCXSCATGTAAGGATTTTTCaTCTGAAGTACAATA^ 

CTCaAACTGCTTTATACTTATAAACAGCCaTCTTAAATAAGaykCGTATTGTGAGTACTGA^^ 

AAATTATCAAAGGAAAA 



FIGURE 181 



x/usr/ seqdb2/sst/DNA/Dnasegs .min/ss .DNA52196 
xsiabunit 1 of 1, 229 aa, 1 stop 
xMW: 26017, pi: 4.73, NX(S/T) : 0 

MGDKIWLPFPVLLLAALPPVLLPGAA,GFTPSLDSDFTFTLPAGQKECFYQPMPLKASI.EIEy 
QVLDGAGLDIDFHLASPEGKTLVFEQRKSDGVHTVETEVGDYMFCFDNTFSTISEKVIFFEL 
ILDNMGEQAQEQEDWKKYITGTDILDMKLEDILES INS IKSRLSKSGHIQILLRAFEARDRN 
IQESNFDRVNFWSMVNLWMWVSAIQVYMLKSLFEDKRKSRT 

Important features: 
Signal peptide: 
amino acids 1-23 

Transmembrane dcnnain: 

amino acids 195-217 

N-myristoylation site. 

amino acids 43-48 

Tyrosine kinase phosphorylation site. 

amino acids 55-62 



FIGURE 182 



CCATCCCTGAGATCTTTTTATAAAAAACCCAGTCTTTGCTGACCAGACAAAGCATACCAGAT 
CTCACCAGAGAGTCGai.GACACTATGCTGCCTCCCATGGCCCTGCCCAGTGTGTCCTGGATG 
CTGCTTTCCTGCCTCATTCTCCTGTGTCAGGTTCiaAGGTGAAGAAACCCAGAAGGAACTGCC 
CTCTCCACGGATCAGCTGTCCCAAAGGCTCCAAGGCCTATGGCTCCCCCTGCTATGCCTTGT 
TTTTGTCACCAAAATCCTGGATGGATGCAGATCTGGCTTGCCAGAAGCGGCCCTCTGGAAAA 
CTGGTGTCTGTGCTCAGTGGGGCTGAGGGATCCTTCGTGTCCTCCCTGGTGAGGAGCATTAG 
TAACAGCTACTCATACATCTGGATTGGGCTCCATGACCCCACACAGGGCTCTGAGCCTGATG 
GAGATGGATGGGAGTGGAGTAGCACTGATGTGATGAATTACTTTGCATGGGAGAAAAATCCC 
TCCACCATCTTAAACCCTGGCCACTGTGGGAGCCTGTCAAGAAGCACAGGATTTCTGAAGTG 
GAAAGATTATAACTGTGATGCAAAGTTACCCTATGTCTGCAAGTTCAAGGACTAGGGCAGGT 
GGGAAGTCAGCAGCCTCAGCTTGGCGTGCAGCTCATCATGGACATGAGACCAGTGTGAAGAC 
TCACCCTGGAAGAGAATATTCTCCCCAAACTGCCCTACCTGACTACCTTGTCATGATCCTCC 
TTCTTTTTCCTTTTTCTTCACCTTCATTTCAGGCTTTTCTCTGTCTTCCATGTCTTQAGATC 
TCAGAGAATAATAATAAAAATGTTACTTTATAAAAAAAAAAAA2^AAAAAAAAA 



FIGURE 183 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .D1S1A56965 
<subunit 1 of 1, 175 aa, 1 stop 
<MW: 19330, pi: 7.25, NX(S/T): 1 

MLPPMALPSVSWMLLSCLILLCQVQGEETQKELPSPRISCPKGSKAYGSPCYALFLSPKSWM 
DADLACQKRPSGKLVSVLSGAEGSFVSSLVRSISNSYSYIWIGLHDPTQGSEPDGDGWEWSS 
TDVMNYFAWEKNPSTILNPGHCGSLSRSTGFLKWKDYNCDAKLPYVCKFKD 

Important features: 
Signal peptide: 
amino acids 1-26 



C-type lectin domain signatTire. 

amino acids 146-171 



FIGURE 184 



CCAGTCTGTCGCC:SVCCTCa.CTTGGTGTCTGCTGTCCCCGCC^GGC^GCCTGGGGTGAGAGC 

ACAGAGGAGTGGGCCGGGACCATGCGGGGGACGCGGCTGGCGCTCCTGGCGCTGGTGCTGGC 

TGCCTGCGGAGAGCTGGCGCCGGCCCTGCGCTGCTACGTCTGTCCGGAGCCCACAGGAGTGT 

CGGACTGTGTCACCATCGCCACCTGCACCACCAACGAAACCATGTGC^GACCACACTCTA 

TCCCGGGAGATAGTGTACCCCTTCCAGGGGGACTCCACGGTGACCAAGTCCTGTGCCAGCAA 

GTGTAAGCCCTCGGATGTGGATGGCATCGGCCAGACCCTGCCCGTGTCCTGCTGCAATACTG 

AGCTGTGCAATGTAGACGGGGCGCCCGCTCTGAACAGCCTCCACTGCGGGGCCCTCACGCTC 

CTCCCACTCTTGAGCCTCCGACTGTAGAGTCCCCGCCCACCCCCATGGCCCTATGCGGCCCA 

GCCCCGAATGCCTTGAAGAAGTGCCCCCTGCACCAGGAA2lAAAAfiAAAAAAAAA 



FIGXTRE 185 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA56405 
<subunit 1 of 1, 125 aa, 1 stop 
<MW: 13115, pi: 5.90, NX(S/T); 1 

MRGTRLALLALVIJ^CGEl^PALRCyVCPEPTGVSDCVTIATCTTNETMCKTTLYSREIVyP 
FQGDSTVTKSCASKCKPSDVDGIGQTLPVSCCOTELCtrVDGl^AIiNSI^ 

Important features: 
Signal peptide: 

amino acids 1-17 

N-glycosylation site, 
amino acids 46-49 



FIGURE 186 



CTGCAGTCAGGACTCTGGGACCGCAGGGGGCTCCCGGACCCTGACTCTGCAGCCGAACCGGC 
ACGGTTTCGTGGGGACCCAGGCTTGCaAAGTGACGGTCATTTTCTCTTTCTTTCTCCCTCTT 
GAGTCCTTCTGAC^TOATGGCTCTGGGCGCAGCGGGAGCTACCCGGGTCTTTGTCGCGATGG 
TAGCGGCGGCTCTCGGCGGCCACCCTCTGCTGGGAGTGAGCGCCACCTTQAACTCGGTTCTC 
AATTCCAACGCTATCAAGAACCTGCCCCCACCGCTGGGCGGCGCTGCGGGGCACCCAGGCTC 
TGCAGTCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGACA 
ACTACCAGCCGTACCCGTGCGCAGAGGACGAGGAGTGCGGCACTGATGAGTACTGCGCTAGT 
CCCACCCGCGGAGGGGACGCAGGCGTGCAAATCTGTCTCGCCTGCAGGAAGCGCCGAAAACG 
CTGCATGCGTCACGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAATATGTGTGTCTT 
CTGATCAAAATCATTTCCGAGGAGAAATTGAGGAAACCATCACTGAAAGCTTTGGTAATGAT 
CATAGCACCTTGGATGGGTATTCCAGAAGAACCACCTTGTCTTCAAAAATGTATCACACCAA 
AGGACAAGAAGGTTCTGTTTGTCTCCGGTCATCAGACTGTGCCTCAGGATTGTGTTGTGCTA 
GACACTTCTGGTCCAAGATCTGTAAACCTGTCCTGAAAGAAGGTCAAGTGTGTACCAAGCAT 
AGQAQAAl^GGCTCTCATGGACTAGAAATATTCCAGCGTTGTTACTGTGGAGAAGGTCTGTC 
TTGCCGGATACAGAAAGATCACCATCAAGCCAGTAATTCTTCTAGGCTTCACACTTGTCAGA 
GACACTAAACCAGCTATCCAAATGCAGTGAACTCCTTTTATATAATAGATGCTATGAAAACC 
TTTTATGACCTTCATCAACTCAATCCTAAGGATATACAAGTTCTGTGGTTTCAGTTAAGCAT 
TCCAATAACACCTTCCAAAAACCTGGAGTGTAAGAGCTTTGTTTCTTTATGGAACTCCCCTG 
TGATTGCAGTAAATTACTGTATTGTAAATTCTCAGTGTGGCACTTACCTGTAAATGCAATGA 
AACTTTTAATTATTTTTCTAAAGGTGCTGCACTGCCTATTTTTCCTCTTGTTATGTAAATTT 
TTGTACACATTGATTGTTATCTTGACTGACAAATATTCTATATTGAACTGAAGTAAATCATT 
TCAGCTTATAGTTCTTAAAAGCATAACCCTTTACCCCATTTAATTCTAGAGTCTAGAACGCA 
AGGATCTCTTGGAATGACAAATGATAGGTACCTAAAATGTAACATGAAAATACTAGCTTATT 
TTCTGAAATGTACTATCTTAATGCTTAAATTATATTTCCCTTTAGGCTGTGATAGTTTTTGA 
AATAAAATTTAACATTTAAAAAAAAAAAAA 



FIGURE 187 



</ usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA57530 
<subiinit 1 of 1, 266 aa, 1 stop 
<MW: 28672, pi: 8.85, NX{S/T) : 1 

MMALGJ^GATRVFVAMVAAALGGHPLLGVSATIJTSVIiNSNAIKNLPPPLGGAAGHPGSAVSA 
APGILYPGGNKYQTIDNYQPyPCAEDEECGTDEYCASPTRGGDAGVQICLACRKRRKRCMRH 
AMCCPGNYCKNGICVSSDQNHFRGEIEETITESFGNDHSTLDGYSRRTTLSSKMYHTKGQEG 
SVCLRSSDCASGLCCARHFWSKICKPVLKEGQVCTKHRRKGSHGLEIFQRCYCGEGLSCRIQ 
KDHHQASNSSRLHTCQRH 

Important features: 
Signal peptide: 

amino acids 1-23 

N-glycosylation site. 

amino acids 256-259 

Fungal Zn(2)-Cys(6) binuclear cluster domain 

amino acids 110-126 



FIGURE 188 



TGTGTTTCCCTGCAGTCAGAATTTGGGACNGCAGGGGTTCCCGGACCTGATTTTGCAGCGGA 
ACGGGAAGGTTTTGTGGGACCCAGGTTG21AATGACGGTCATTTTTTTTTCTTTCTCCTTCNG 
GAGTCCTTNTGAGANGATGGTTTTGGGCGCAGCGGGAGCTAACCCGGTTTTTTGTNGCGATG 
GTAGCGGCGGTTTTCGGCGGCCACCTTNTGCTGGGAGTGAGCGCCACCTTGAATCGGTTTTC 
AATTCCS^CGNTATC^GAACCTGCCCCCa^CCGNTGGGCGGCGCTGCGGGGCACCCAGGNTT 
TGCAGTCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGACA 
ATTACCAGCCGTACCCGTGCGCAGAGGACGAGGAGTGCGGCACTGATGAGTACTGCGCTAGT 
CCCACCCGCGGAGGGGANGCGGGCGTGCAAATNTGTNTNGCCTGCAGGAAGCGCCGAAAACG 
CTGCATGCGTCANGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAATATGTGTGTNTT 
CTGATCAAAATCATTTCCGAGGAGAAATTGAGGAAACCATCACTGAAAGCTTTGGTAATGAT 
CATAGCACCTTGGATGGG 



FIGTTRE 189 



GJMK3AACCTACCGGTACCGGCCGa3CGCTGGTAGTCGCCGGTGTGGCraCa\^ 

CTGGGCCGTCGGftGAGTGOSTGTGCTTCrCTCCTGCACXKraGTGCTTGGGCTCGGCCAGGa^^ 

GGGTTTGAGGATGGGGGAGTAGCTACAGGAAGCGACCCCGCGATGGCS^GGTATATT^ 

AGTATTAGAAATGAGCTGAAGACCATTCACAGATTAATATTTTTGGGGACaGATTTGTGATGCTTG^ 

TGAAGTAATGTAGACAGAAGTTCTCAAATTTGCaiTATTACATCAACTGGAACCAGCAGTGaATC^ 

TTAAATCAGAACTTGCATA&GAAAGAGAATGGGAGTCTGGTTAaATAAAGATGACTATATCAGAGACTTGAAAAg 

GATCATTCTCTGTTTTCTGATAGTGTATATGGCCATTTTAGTGGGCACAGATCy^GGATTTTTACAGTTTACTTGG 

AGTGTCCAAAACTGCAAGCAGTAGAGAAATAAGACAAGCTTTCAAGAZ^TTGGCATTGAAGTTACATCCTGATi^ 

AAACCCGAATAACCCaAATGCa^CATGGCGATTTTTTAAAAATAAATAGAGCATATGAAGTACTC^^ 

TCTACGGAAAAAGTATQACa^TATGGaGAA?^GGGACTTGAGGATAATCAAGGTGGCCAGTATGA^ 

CTATTATCGTTATQATTTTGGTATTTATGATGATGATCCTGAAATCATAACyVTTGQAAAGAAGAGAATTTGATGC 

TGCTGTTAATTCTGGAGAACTGTGGTTTGTAAATTTTTACTCCCCaVGGCrGTTCAC^^ 

CAGATGGAGAGACTTTGCTAAAOaAGTGGaTGGGTTACTTCGAATTGGAGCTGTT^ 

GCTTTGCCGAATGAAAGGAGTCAACAGCTATCCCAGTCTCTTCATTTaTCX^ 

TCATGGAGACaGATCAAAGGAGAGTTTAGTGAGTTTTGCaATGa«3CATGTTAGAAGT^ 

GACAGGAAATTTTGTCSiACTCa^TACAAACTGCTTTTGCTGCTGGTATTGGCrGGCTGATCACrTra 

AGGAGGAGATTGTTTGACTTGACaGACACGACTCAGGCTTAGTGGCATGTTGTTTCrC^ 

AGAAATATATTTGGAAGTAATACaTAATCTTCCy^GATTTTGAACTACTTTCGGCAAACACACTAGAGGATCGTTT 

GGCTCATCATCGGTGGCTGTTATTTTTTCATTTTGGAAAAAATGAAAATTCAAATGATCCTGAGCTGAAAAAACT 

AAAAACTCHACTTAAAAATGATCATATTCAAGTTGGCAGGTTTGACTGTTCCTCTGCACCAGAC^ 

TCTGTATGTTTTTCAGCCGTCTCTAGCAGTATTTAAAGGACaAGGAACCAAAQAATATGAAATTCATCA 

GAafiATTCTATATGATATACTTGCCTTTGCCAAAGAAAGTGTGAATTCTCATGTTACCa^ 

TTTTCCTGCa^TGACaAAGAACCATGGCriTGTTGATTTCTTT^ 

ACaWSAGTTACGAAGAGO^TOWiATCTTCTTTATGGTCaGC^ 

GGGACTCTGTAAOVTGTATAACATTa^GGCTTATCCAACS^CAGTGGTATTC^ 

TGAAGQACATCavCrCTGCTGAACAAATCTTGGAGTTCATAGAGGATCTTATGAATCCrTCAGT^ 

ACCCaCCa^CCTTCAACGAACTAGTTACACAAAGAAAACACaACGAAGTCTGGATGGTTGATTTCTATTCTCaS 

GTGTGa,TCCTTGCC:ffiAGTCrTAATGCCai.GAATGGAAAAGAATGGCCa3GAaVTTAACTGGACTGATa^ 

Ca.GTATAGATTGCCAACaGTATGATTCTTTTTGTGCCCAG6AAAACGTTaiAAGATACCCTGAGATAAG^ 

TCCCCCaAAATCAAATAAAGCTTATa^GTATCACAGTTACAATGGTTGGAATAGGGATGC 

CTGGGGTCTAGGATTTTTACCrCaAGTATCCACAGATCTAACACCT 

GAAAAATCaTTGGGTGATTGATTTCTATGCTCCTTGGTGTGGACCTTGCCa.GAATTTTGCTCCAGAATTTGAGCT 

CTTGGCTAGGATGATTAAAGGAaaAGTGAaAGCTGGftAAAGTAGACTGTCAGGCTTATGCTCA 

AGCTGGGATCAGGGCCTATCa^CTGTTAAGTTTTATTTCTAOSaAAGAGC^^ 

GATAAATACCAGAGATGCAAAAGa^TCGCTGCCTTAATAAGTGAAAAATTGQaAACTCT 

QAGGAATAAGGATGAACTTTGATAATGTTGAAGATGAAGAAAAAGTTTAAAAGZ^AATTCT 

AAGACACCTATTTAGAATGTTACATTTATGATGGGAATGAATGAACATTATCTTAGACTTGCAGTTGT^ 

GAATTATCTACaGCACTGGTGTAAAAGJUlGGGTCTCCaAACTTTTTCTGTAAAGGGCCGGTTTATAAATATTTT^ 

GaCTTTGO^GGCTATAATATATGGTTO^CaCATGAGAACaAGAATAGAGTa^TCaTGTATTCT 

TTTAACSACCTTTAAAAAATATTAAAACGATTCTTAGCTCAGAGCCATACAAAAGTAGGCTGQATTCAGTCCATO 

GACCS^TAGATTGCTGTCCCCCTCGACGGACTTATAATGTTTCAGGTGGCTGGCTTGAACATGAGTCTGCTGTGCT 

ATCT AC^TA AATGTCTAAGTTGTATAAAGTCCACTTTCCCTTCACGTTTTTTGGCTG^ 

TAGTTTTTGGTCACTTGTTCTCCTAAAAATGCTATCCCTAACCATATATTTATATTTCGTT 

GATGTGGCACAGTAAAaiAACCCTGTTATGCTGTATTATTATGAGGaGATTCTT^ 

AAGGTTGAAAAAATGCTTTTAATTTTTGAa^CGAGAAAC^VGTGCAGCaGTATATGTGC^ 

AAATTTGAGCS^CaGTAAGTGCAC3W^TTCTGTAGTTTGCTGTATCATCCA^ 

TAGCaATTAACTGGGCATTGTAGAGTATCCTAAATATGTTATCAAGTATTTAGAGTTCTATATTTTAAAGATATA 

TGTGTTCaVTGTATTTTCTGAAATTGCTTTCATAGaAATTTTCCCACTGATAGTTGATTTTTGAGGCATCTA^ 

TTACATATTTGCCTTCTGAACTTTGTTTTGACCTGTATCCTTTATTTACATTGGGTTTTTCTTTCATAGTTTTG^ 

TTTTTCACTCCTGTCCAGTCTATTTATTATTCAAATAGGAAAAATTACTTTACAGGTTGTTTTACTGTAGCTTAT 

AATGATACTGTAGTTATTCCyiGTTACTAGTTTACTGTCAGAGGGCTGCCTTTTTCAGATAAATATTGACAT^ 

ACTGAAGTTATTTTTATAAGAAAATCaAGTATATAAATCTAGGAAAGGGATCTTCTAGTTTCTGTGTTGOT 

CTCaAAGAATCaO^TTTGTOWSTAACATGTAGTTGTTTAGTTATAAOT 

CCSiATaVGTauU^AGAGGTCSiATGAATTAAAAGGCTTGCaACTTT^ 



FIGURE 190 



</usr/segdb2/sst/DNA/Dnaseqs .min/ss .DNA56439 
<siabtmit 1 of 1, 747 aa, 1 stop 
<MW: 86127, pi: 7.46, NX(S/T) : 2 

MGVWLNKDDYIRDLKRI ILCFI.IVYMAILVGTDQDFYSLLGVSKTAS SRE IRQAFKKLALKL 

HPDKNPNNPNAHGDFLKINRAYEVLKDEDLRKKXDKyGEKGLEDNQGGQYESWNYYRYDFGI 

YDDDPEIITLERREFDAAVNSGELWFVNFYSPGCSHCHDLAPTWKDFAKEVDGLLRIGAVNC 

GDDRMLCRMKGVNSYPSLFIFRSGMAFVKYHGDRSKESLVSFi^QHWSTVTELWTGNFW 

IQTAFAAGIGWLITFCSKGGDCLTSQTRLRLSGMLFLNSLDAKEIYLEVIHNLPDFELLSAN 

TLEDRLAHHRWLLFFHFGKNENSNDPELKKLKTLLKNDHIQVGRFDCS SAPD I CSNLYVFQP 

SLAVFKGQGTKEYEIHHGKKILYDILAFAKESVNSHVTTLGPQNFPANDKEPWLVDFFAPWC 

PPCRALLPELRRASNLLYGQLKFGTLDCTVHEGLCNMYNIQAYPTTWFNQSNIHEYEGHHS 

AEQILEFIEDLmPSWSLTPTTFNELVTQRKHNEVTimVDFySPWCHPCQVLMPEWKRr^ 

LTGLINVGSIDCQQYHSFCAQEKVQRYPEIRFFPPKSNKAYQYHSYNGWNRDAYSLRIWGLG 

FLPQVSTDLTPQTFSBKVLQGKNHWIDFYAPWCGPCQNFAPEFELLARMIKGKVKAGKVDC 

QAYAQTCQKAGIRAYPTVKEYFYERAKE?NFQEEQIimiDAKAIAALIS^ 

Important features: 

Endoplasmic reticulum targeting sequence, 
amino acids 744-747 

Cytochrome c family heme-binding site signature. 

amino acids 158-163 

Nt-dnaJ dranain signature. 

amino acids 77-96 

N-glycosylation site. 

amino acids 484-487 



FIGURE 191 



AGACAGTACCTCCTCCCTAGGACTACACAAGGACTGAACCAGAAGGAAGAGGACAGAGCAAA 

GCCaTGAACATCATCCTAGAAATCCTTCTGCTTCTGATO^CCATCATCn'ACT 

GTCGTTGGTGAAGTTTTTCATTCCTCAGAGGAGAAAATCTGTGGCTGGGGAGATTGTTCTCA 

TTACTGGAGCTGGGCATGGAATAGGCAGGCAGACTACTTATGAATTTGCAAAACGACAGAGC 

ATATTGGTTCTGTGGGATATTAATAAGCGCGGTGTGGAGGAAACTGCAGCTGAGTGCCGAAA 

ACTAGGCGTCACTGCGCATGCGTATGTGGTAGACTGCAGCAACAGAGAAGAGATCTATCGCT 

CTCTAAATCAGGTGAAGAAAGAAGTGGGTGATGTAACAATCGTGGTGAATAATGCTGGGACA 

GTATATCCAGCCGATCTTCTCAGCACCAAGGATGAAGAGATTACCAAGACATTTGAGGTCAA 

CATCCTAGGACATTTTTGGATCACAAAAGCACTTCTTCCATCGATGATGGAGAGAAATCATG 

GCCACATCGTCACAGTGGCTTCAGTGTGCGGCCACGAAGGGATTCCTTACCTCATCCCATAT 

TGTTCCAGCAAATTTGCCGCTGTTGGCTTTCACAGAGGTCTGACATCAGAACTTCAGGCCTT 

GGGAAAAACTGGTATCAAAACCTCATGTCTCTGCCCAGTTTTTGTGl^TACTGGGTTCACCA 

AAAATCCAAGCACAAGATTATGGCCTGTATTGGAGACAGATGAAGTCGTAAGAAGTCTGATA 

GATGGAATACTTACCAATAAGAAAATGATTTTTGTTCCATCGTATATCAATATCTTTCTGAG 

ACTACAGAAGTTTCTTCCTGAACGCGCCTCAGCGATTTTAAATCGTATGCAGAATATTCAAT 

TTGAAGCAGTGGTTGGCCACAAAATCAAAATGAAATGAATAAATAAGCTCCAGCCAGAGATG 

TATGCATGATAATGATATGAATAGTTTCGAATCAATGCTGCAAAGCTTTATTTCACATTTTT 

TCAGTCCTGATAATATTAAAAACATTGGTTTGGCACTAGCAGCAGTCAAACGAACAAGATTA 

ATTACCTGTCTTCCTGTTTCTCAAGAATATTTACGTAGTTTTTCATAGGTCTGTTTTTCCTT 

TCATGCCTCTTAAAAACTTCTGTGCTTACATAAACATACTTAAAAGGTTTTCTTTAAGATAT 

TTTATTTTTCCATTTAAAGGTGGACAAAAGCTACCTCCCTAAAAGTAAATACAAAGAGAACT 

TATTTACACAGGGAAGGTTTAAGACTGTTCAAGTAGCATTCCAATCTGTAGCCATGCCACAG 

AATATCAACAAGAACACAGAATGAGTGCACAGCTAAGAGATCAAGTTTCAGCAGGCAGCTTT 

ATCTCAACCTGGAC^TATTTTAAGATTCAGCATTTGAAAGATTTCCCTAGCCTCTTCCTTTT 

TCATTAGCCCAAAACGGTGCAACTCTATTCTGGACTTTATTACTTGATTCTGTCTTCTGTAT 

AACTCTGAAGTCCACCAAAAGTGGACCCTCTATATTTCCTCCCTTTTTATAGTCTTATAAGA 

TACATTATGAAAGGTGACCGACTCTATTTTAAATCTCAGAATTTTAAGTTCTAGCCCCATGA 

TAACCTTTTTCTTTGTAATTTATGCTTTCATATATCCTTGGTCCCAGAGATGTTTAGACAAT 

TTTAGGCTCAAAAATTAAAGCTAACACAGGAAAAGGAACTGTACTGGCTATTACATAAGAAA 

CAATGGACCCAAGAGAAGAA 



FIgURE 192 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA56409 
<siibunit 1 of 1, 300 a.a., 1 stop 
<MW: 33655, pi: 9.31, NX(S/T) : 1 

MNI ILEILLLLITI I YSYLESLVKFFIPQRRKSVAGEIVLITGAGHGIGRQTTYEFAKRQSI 
LVLWDINKRGVEETAAECRKLGVTimAYVVDCSliniEEIYRSIiNQVKKEVGDVTIVVmAGTV 
yPADLLSTKDEEITKTFEVNILGHFWITKALLPSMMERNHGHIVTVASVCGHEGIPYLIPYC 
SSKFAAVGFHRGLTSELQALGKTGIKTSCLCPVFVNTGFTKNPSTRLWPVLETDEVVRSLID 
GILTNKKMIFVPSYINIFLRLQKFLPERASAILNRMQNIQFEAWGHKIKMK 

Important features: 
Signal peptide: 
amino acids 1-19 

CAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 30-33 and 58-61 

Short- chain alcohol dehydrogenase family protein 
amino acids 165-202, 37-49, 112-122 and 210-219 



FIGURE 193 



CGGCGGCGGCTGCGGGCGCGAGGTGAGGGGCGCGAGGTGAGGGGCGCGAGGTTCCCAGCAGG 
ATGCCCCGGCTCTGCAGGAAGCTGAAGTGAGAGGCCCGGAGAGGGCCCAGCCCGCCCGGGGC 
AGGATGACCAAGGCCCGGCTGTTCCGGCTGTGGCTGGTGCTGGGGTCGGTGTTCATGATCCT 
GCTGATCATCGTGTACTGGGACAGCGCAGGCGCCGCGCACTTCTACTTGCACACGTCCTTCT 
CTAGGCCGCACACGGGGCCGCCGCTGCCCACGCCCGGGCCGGACAGGGACAGGGAGCTCACG 
GCCQACTCCGATGTCGACGAGTTTCTGGACAAGTTTCTCAGTGCTGGCGTGAAGCAGAGCGA 
CCTTCCCAGAAAGGAGACGGAGCAGCCGCCTGCGCCGGGGAGCATGGAGGAGAGCGTGAGAG 
GCTACGACTGGTCCCCGCGCGACGCCCGGCGCAGCCCAGACCAGGGCCGGCAGCAGGCGGAG 
CGGAGGAGCGTGCTGCGGGGCTTCTGCGCCAACTCCAGCCTGGCCTTCCCCACCAAGGAGCG 
CGCATTCGACGACATCCCCAACTCGGAGCTGAGCCACCTGATCGTGGACGACCGGCACGGGG 
CCATCTACTGCTACGTGCCCAAGGTGGCCTGCACCAACTGGAAGCGCGTGATGATCGTGCTG 
AGCGGAAGCCTGCTGCACCGCGGTGCGCCCTACCGCGACCCGCTGCGCATCCCGCGCGAGCA 
CGTGCACAACGCCAGCGCGCACCTGACCTTCAACAAGTTCTGGCGCCGCTACGGGAAGCTCT 
CCCGCCACCTCATGAAGGTCAAGCTCAAGAAGTACACCAAGTTCCTCTTCGTGCGCGACCCC 
TTCGTGCGCCTGATCTCCGCCTTCCGCAGCAAGTTCGAGCTGGAGAACGAGGAGTTCTACCG 
CAAGTTCGCCGTGCCCATGCTGCGGCTGTACGCCAACCACACCAGCCTGCCCGCCTCGGCGC 
GCGAGGCCTTCCGCGCTGGCCTCAAGGTGTCCTTCGCCAACTTCATCCAGTACCTGCTGGAC 
CCGCACACGGAGAAGCTGGCGCCCTTCAACGAGCACTGGCGGCAGGTGTACCGCCTCTGCCA 
CCCGTGCCAGATCGACTACGACTTCGTGGGGAAGCTGGAGACTCTGGACGAGGACGCCGCGC 
AGCTGCTGCAGCTACTCCAGGTGGACCGGCAGCTCCGCTTCCCCCCGAGCTACCGGAACAGG 
ACCGCCAGCAGCTGGGAGGAGGACTGGTTCGCCAAGATCCCCCTGGCCTGGAGGCAGCAGCT 
GTATAAACTCTACGAGGCCGACTTTGTTCTCTTCGGCTACCCCAAGCCCGAAAACCTCCTCC 
GAGACTGAAAGCTTTCGCGTTGCTTTTTCTCGCGTGCCTGGAACCTGACGCACGCGCACTCC 
AGTTTTTTTATGACCTACGATTTTGCAATCTGGGCTTCTTGTTCACTCCACTGCCTCTATCC 
ATTGAGTACTGTATCGATATTGTTTTTTAAGATTAATATATTTCAGGTATTTAATACQA 



FIGURE 194 

</usr/seqdb2/sst/DNA/Dnaseqs .tnin/ss .DNA56112 
<subunit 1 of 1, 414 aa, 1 stop 
<MW: 48414, pi: 9.54, NX(S/T} : 4 

MTKARLFRLWLVLGSVFMILLIIVYWDSAGAAHFYLHTSFSRPHTGPPLPTPGPDRDRELTA 
DSDVDEFLDKFLSAGVKQSDLPRKETEQPPAPGSMEESVRGYDWSPRDARRSPDQGRQQAER 
RSVLRGFCaNSSIJ^PTKERAFDDIPNSELSHLIVDDRHGAIYCYVPKVACTNWKRVMIVLS 
GSLLHRGAPYRDPLRIPREHVHNASAHLTFNKPWRRYGKLSRHIiMKVKLKKYTKFLFVRDPF 
VRLISAFRSKFELENEEFYRKFAVPMLRLYANHTSLPASAREAFRAGLKVSFANFIQYLLDP 
HTEKLAPFNEHWRQVYRLCHPCQIDYDFVGKLETLDEDAAQLLQLLQVDRQLRFPPSYRNRT 
ASSWEEDWFAKIPLAWRQQLYKLYEADFVLFGYPKPENLLRD 

Important features: 
Signal peptide: 

amino acids 1-31 

N-glycosylation sites. 

amino acids 134-137, 209-212, 280-283 and 370-373 

TNFR/NGFR family cysteine-rich region protein 

amino acids 329-332 



FIGURE 195 

TCGGGCCAG2^TTCGGCACGAGGCGGCACGAGGGCGACGGCCTCACGGGGCTTTGGA.GGTGA 
AAGAGGCCCAGAGTAGAGAGAGAGAGAGACCGACGTACACGG GATGG CTACGGGAACGCGCT 
ATGCCGGGAAGGTGGTGGTCGTGACCGGGGGCGGGCGCGGCATCGGAGCTGGGATCGTGCGC 
GCCTTCGTGAACAGCGGGGCCCGAGTGGTTATCTGCGACAAGGATGAGTCTGGGGGCCGGGC 
CCTGGAGCAGGAGCTCCCTGGAGCTGTCTTTATCCTCTGTGATGTGACTCAGGAAGATGATG 
TGAAGACCCTGGTTTCTGAGACCATCCGCCGATTTGGCCGCCTGGATTGTGTTGTCAACAAC 
GCTGGCCACCACCCACCCCCACAGAGGCCTGAGGAGACCTCTGCCCAGGGATTCCGCCAGCT 
GCTGGAGCTGAACCTACTGGGGACGTACACCTTGACCAAGCTCGCCCTCCCCTACCTGCGGA 
AGAGTCAAGGGAATGTCATCAACATCTCCAGCCTGGTGGGGGCAATCGGCCAGGCCCAGGCA 

O GTTCCCTATGTGGCCACCAAGGGGGCAGTAACAGCCATGACCAAAGCTTTGGCCCTGGATGA 

Ci 

y-. AAGTCCATATGGTGTCCGAGTCAACTGTATCTCCCCAGGAAACATCTGGACCCCGCTGTGGG 
^1 AGGAGCTGGCAGCCTTAATGCCAGACCCTT^GGCCACAATCCGAGAGGGCaTGCTGGCCC^ 
ii CCACTGGGCCGCaLTGGGCCAGCCCGCTGAGGTCGGGGCTGCGGCaGTGTTCCTGGCCT 

AGCCAACTTCTGCACGGGCATTGAACTGCTCGTGACGGGGGGTGCAGAGCTGGGGTACGGGT 
GCAAGGCCAGTCGGAGCACCCCCGTGGACGCCCCCGATATCCCTTCCTGATTTCTCTCATTT 

•Pi 

p: CTACTTGGGGCCCCCTTCCTAGGACTCTCCCACCCCAAACTCCAACCTGTATCAGATGCAGC 
J: CCCCAAGCCCTTAGACTCTAAGCCCAGTTAGCAAGGTGCCGGGTCACCCTGCaGGTTC 
t]^ AAAAACGATTTGCAGCC 



FIGURE 196 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA56045 
<siabunit 1 of 1, 270 aa, 1 stop 
<MW: 28317, pi: 6.00, NX(S/T): 1 

MATGTRYAGKVVVVTGGGRGIGAGIVRAFVNSGARWICDKDESGGRALEQELPGAVFILCD 
VTQEDDVKTLVSETIRRFGRLDCWNNAGHHPPPQRPEETSAQGFRQLLELNLLGTYTLTKL 
ALPYLRKSQGNVINISSLVGAIGQAQAVPYVATKGAVTAMTKALALDESPYGVRVNCISPGN 
IWTPLWEELAALMPDPRATIREGMLAQPLGRMGQPAEVGAAAVFLASEANFCTGIELLVTGG 
AELGYGCKASRSTPVDAPDIPS 

Important features: 
N-glycosylation site. 

amino acids 138-141 

Short -chain alcohol dehydrogenase family protein 
amino acids 10-22, 81-91, 134-171 and 176-185 



FIGURE 197 



AGGCGGGCAGCAGCTGCAGGCTGACCTTGCAGCTTGGCGGAaTOGACT 

CTGTTTCTTCTTACCATTTCCATCTTCCTGGGGCTGGGCaVGCCCAGGAGCCCC^^ 

GAGGAAGGGGCAAGGGCGGCCTGGGCCCCTGGCCCCTGGCCCTCACCAGGTGCCACTGGACC 

TGGTGTCACGGATGAAACCGTATGCCCGCATGGAGGAGTATGAGAGGAACATCGAGGAGATG 

GTGGCCCAGCTGAGGAACAGCTCAGAGCTGGCCCAGAGAAAGTGTGAGGTCAACTTGCAGCT 

GTGGATGTCCAACAAGAGGAGCCTGTCTCCCTGGGGCTACAGCATCAACCACGACCCCAGCC 

GTATCCCCGTGGACCTGCCGGAGGCACGGTGCCTGTGTCTGGGCTGTGTGAACCCCTTCACC 

ATGCAGGAGGACCGCAGCATGGTGAGCGTGCCGGTGTTCAGCCAGGTTCCTGTGCGCCGCCG 

CCTCTGCCCGCCACCGCCCCGCACAGGGCCTTGCCGCCAGCGCGCAGTCATGGAGACCATCG 

CTGTGGGCTGCACCTGCATCTTCTGAATCACCTGGCCCAGAAGCCAGGCCAGCAGCCCGAGA 

CCATCCTCCTTGCACCTTTGTGCCAAGAAAGGCCTATGAAAAGTAAACACTGACTTTTGAAA 

GCAAG 



FIGURE 198 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss -DNA59294 
<suiDunit 1 of 1, 180 aa, 1 stop 
<MW: 20437, pi: 9.58, NX{S/T) : 1 

MDWPHNLLFLLTISIFLGLGQPRSPKSKRKGQGRPGPLAPGPHQVPLDLVSRMKPYARMEEY 
ERNI EEMYAQLRNSSELAQRKCEVNLQLWMSNKRSLSPWGYS INHDPSRI PVDLPEARCLCL 
GCVNPFTMQEDRSMVSVPVFSQVPVRRRLCPPPPRTGPCRQRAVMETIAVGCTCIF 

Important features: 
Signal peptide: 
amino acids 1-20 

N-glycosylation site. 

amino acids 75-78 

Homologous region to IL-T.7 
amino acids 96-180. 



FIGURE 199 



GCGCCGCCAGGCGTAGGCGGGGTGGCCCTTGCGTCTCCCGCTTCCTTGAAAAACCCGGCGGG 
CGAGCGAGGCTGCGGGCCGGCCGCTGCCCTTCCCCACACTCCCCGCCGAGAAGCCTCGCTCG 
GCGCCCAACATGGCGGGTGGGCGCTGCGGCCCGCAGCTAACGGCGCTCCTGGCCGCCTGGAT 
CGCGGCTGTGGCGGCGACGGCAGGCCCCGAGGAGGCCGCGCTGCCGCCGGAGCAGAGCCGGG 
TCCAGCCCATGACCGCCTCCAACTGGACGCTGGTGATGGAGGGCGAGTGGATGCTGAAATTT 
TACGCCCCATGGTGTCCATCCTGCCAGCAGACTGATTCAGAATGGGAGGCraTTGCAAAGAA 
TGGTGAAATACTTC^GATCAGTGTGGGGAAGGTAGATGTC^TTCS^GAACCAGGTTTGAGTG 
GCCGCTTCTTTGTCACCACTCTCCCAGCATTTTTTCATGCAAAGGATGGGATATTCCGCCGT 
TATCGTGGCCCAGGAATCTTCGAAGACCTGCAGAATTATATCTTAGAGAAGAAATGGCAATC 
AGTCGAGCCTCTGACTGGCTGGAAATCCCCAGCTTCTCTAACGATGTCTGGAATGGCTGGTC 
TTTTTAGCATCTCTGGCAAGATATGGCATCTTCACAACTATTTCACAGTGACTCTTGGAATT 
CCTGCTTGGTGTTCTTATGTGTTTTTCGTCATAGCCACCTTGGTTTTTGGCCTTTTTATGGG 
TCTGGTCTTGGTGGTAATATCAGAATGTTTCTATGTGCCACTTCCAAGGCATTTATCTGAGC 
GTTCTGAGCAGAATCGGAGATCAGAGGAGGCTCATAGAGCTGAACAGTTGCAGGATGCGGAG 
GAGGAAAAAGATGATTCAAATGAAGAAGAAAACAAAGACAGCCTTGTAGATGATGAAGAAGA 
GAAAGAAGATCTTGGCGATGAGGATGAAGCAGAGGAAGAAGAGGAGGAGGACAACTTGGCTG 
CTGGTGTGGATGAGGAGAGAAGTGAGGCCAATGATCAGGGGCCCCCAGGAGAGGACGGTGTG 
ACCCGGGAGGAAGTAGAGCCTGAGGAGGCTGAAGAAGGCATCTCTGAGCAACCCTGCCCAGC 
TGACACAGAGGTGGTGGAAGACTCCTTGAGGCAGCGTAAAAGTCAGCATGCTGACAAGGGAC 
TGTAGATTTAATGATGCGTTTTCAAGAATACACACCAAAACAATATGTCAGCTTCCCTTTGG 
CCTGCAGTTTGTACCAAATCCTTAATTTTTCCTGAATGAGCAAGCTTCTCTTAAAAGATGCT 
CTCTAGTCATTTGGTCTCATGGCAGTAAGCCTCATGTATACTAAGGAGAGTCTTCCAGGTGT 
GACAATCAGGATATAGAAAAACAAACGTAGTGTTGGGATCTGTTTGGAGACTGGGATGGGAA 
CAAGTTCATTTACTTAGGGGTCAGAGAGTCTCGACCAGAGGAGGCCATTCCCAGTCCTAATC 
AGCACCTTCCAGAGACAAGGCTGCAGGCCCTGTGAAATGAAAGCCAAGCAGGAGCCTTGGCT 
CCTGAGCATCCCCAAAGTGTAACGTAGAAGCCTTGCATCCTTTTCTTGTGTAAi\GTATTTAT 
TTTTGTC?U^TTGCAGGAAACATCAGGCACCACAGTGCATGAAAAATCTTTCACAGCTAGAA 
ATTGAAAGGGCCTTGGGTATAGAGAGCAGCTCAGAAGTCT^TCCCAGCCCTCTGAATCrCCTG 
TGCTATGTTTTATTTCTTACCTTTAATTTTTCCAGCATTTCCACCATGGGCATTCAGGCTCT 
CCACACTCTTCACTATTATCTCTTGGTCAGAGGACTCCAATAACAGCCAGGTTTACATGAAC 
TGTGTTTGTTCATTCTGACCTAAGGGGTTTAGATAATCAGTAACCATAACCCCTGAAGCTGT 
GACTGCCAAACATCTCAAATGAAATGTTGTGGCCATCAGAGACTCAAAAGGAAGTAAGGATT 

AAGTTTTCTAAGCAATATTTTTCAAGCCAGAAGTCCTCTAAGTCTTGCCAGTACAAGGTAGT 
CTTGTGAAGAAAAGTTGAATACTGTTTTGTTTTCATCTCAAGGGGTTCCCTGGGTCTTGAAC 
TACTTTAATAATAACTAAAAAACCACTTCTGATTTTCCTTCAGTGATGTGCTTTTGGTGAAA 
GAATTAATGAACTCCAGTACCTGAAAGTQAAAGATTTGATTTTGTTTCCATCTTCTGTAATC 
TTCCAAAGAATTATATCTTTGTAAATCTCTCAATACTCAATCTACTGTAAGTACCCAGGGAG 
GCTAATTTCTTT 



FIGURE 200 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA56433 
<siabunit 1 of 1, 349 aa, 1 stop 
<MW: 38952, pi: 4-34, NX(S/T) : 1 

MAGGRCGPQLTALLAAWIAAVAATAGPEEAALPPEQSRVQPMTASNWTLVMEGEWMLKFYAP 
WCPSCQQTDSEWEAFAKNGEILQISVGKVDVIQEPGLSGRFFVTTLPAFFHAKDGIFRRYRG 
PGIFEDLQNYILEKKWQSVEPLTGWKSPASLTMSGMAGLFSISGKIWHLHNYFTVTLGIPAW 
CSYVFFVIATLVPGLFMGLVLWISECFYVPLPRHLSERSEQNERSEEAHRAEQLQDAEEEK 
DDSNEEENKDSLVDDEEEKEDL6DEDEAEEEEEEDNLAAGVDEERSEANDQGPPGEDGVTRE 
EVEPEEAEEGISEQPCPADTEWEDSLRQRKSQHADKGL 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 191-211 

N-glycosylation site. 

amino acids 46-49 

Thioredoxin family proteins. (homologous region to disulfide 

isomerase) 

amino acids 56-72 

Plavodoxin proteins 

amino acids 173-187 



FIGURE 2 01 



ATCTGGTTGAACTACTTAAGCTTAATTTGTTAAACTCCGGTAAGTACCTAGCCCACATGATT 
TGACTCAGAGATTCTCTTTTGTCCACAGACAGTCATCTCAGGGGCAGAAAGAAAAGAGCTCC 
CAAATGCTATATCTATTCAGGGGCTCTCAAGAACAATGGAATATCATCCTGATTTAGAAAAT 
TTGGATGAAGATGGATATACTCAATTACACTTCGACTCTCAAAGCAATACCAGGATAGCTGT 
TGTTTCAGAGAAAGGATCGTGTGCTGCATCTCCTCCTTGGCGCCTCATTGCTGTAATTTTGG 
GAATCCTATGCTTGGTAATACTGGTGATAGCTGTGGTCCTGGGTACCATGGGGGTTCTTTCC 
AGCCCTTGTCCTCCTAATTGGATTATATATGAGAAGAGCTGTTATCTATTCAGCATGTCACT 
AAATTCCTGGGATGGAAGTAAAAGACAATGCTGGCAACTGGGCTCTAATCTCCTAAAGATAG 
ACAGCTCAAATGAATTGGGATTTATAGTAAAACAAGTGTCTTCCCAACCTGATAATTCATTT 
TGGATAGGCCTTTCTCGGCCCCAGACTGAGGTACCATGGCTCTGGGAGGATGGATCAACATT 
CTCTTCTAACTTATTTCAGATCAGAACCACAGCTACCCAAGAAAACCCATCTCCAAATTGTG 
TATGGATTCACGTGTCAGTCATTTATGACCAACTGTGTAGTGTGCCCTCATATAGTATTTGT 
GAGAAGAAGTTTTCAATGTAAGAGGAAGGGTGGAGAAGGAGAGAGAAATATGTGAGGTAGTA 
AGGAGGACAGAAAACAGAACAGAAAAGAGTAACAGCTQAGGTCAAGATAAATGCAGAAAATG 
TTTAGAGAGCTTGGCCAACTGTAATCTTAACCAAGAAATTGAAGGGAGAGGCTGTGATTTCT 
GTATTTGTCGACCTACAGGTAGGCTAGTATTATTTTTCTAGTTAGTAGATCCCTAGACATGG 
AATCAGGGCAGCCAAGCTTGAGTTTTTATTTTTTATTTATTTATTTTTTTGAGATAGGGTCT 
CACTTTGTTACCCAGGCTGGAGTGCAGTGGCACAATCTCGACTCACTGCAGCTATCTCTCGC 
CTCAGCCCCTCAAGTAGCTGGGACTACAGGTGCATGCCACCATGCCAGGCTAATTTTTGGTG 
TTTTTTGTAGAGACTGGGTTTTGCCATGTTGACCAAGCTGGTCTCTAACTCCTGGGCTTAAG 
TGATCTGCCCGCCTTGGCCTCCCAAAGTGCTGGGATTACAGATGTGAGCCACCACACCTGGC 
CCCAAGCTTGAATTTTCATTCTGCCATTGACTTGGCATTTACCTTGGGTAAGCCATAAGCGA 
ATCTTAATTTCTGGCTCTATCAGAGTTGTTTCATGCTCAACAATGCCATTGAAGTGCACGGT 
GTGTTGCCACGATTTGACCCTCAACTTCTAGCAGTATATCAGTTATGAACTGAGGGTGAAAT 
ATATTTCTGAATAGCTAAATGAAGAAATGGGAAAAAATCTTCaiCCACAGTC^GAGCAATTTT 
ATTATTTTCATCAGTATGATCATAATTATGATTATCATCTTAGTliAAAAGCAGGAACTCCTA 
CTTTTTCTTTATCAATTAAATAGCTCAGAGAGTACATCTGCCATATCTCTAATAGAATCTTT 

CACGATCTCGGCTCACCGCAACCTCCGCCCCCTGGGTTCAAGCAATTCTCCTGCCTCAGCCT 
CCCAAGTAGCTGGGATTACAGTC^GGCACCACCACACCCGGCTAATTTTGTATTTTTTTAGT 
AGAGACAGGGTTTCTCCATGTCGGTCAGGGTAGTCCCGAACTCCTGACCTCAAGTGATCTGC 
CTGCCTCGGCCTCCCAAGTGCTGGGATTACAGGCGTGAGCCACTGCACCCAGCCTAGAATCT 
TGTATAATATGTAATTGTAGGGAAACTGCTCTCATAGGAAAGTTTTCTGCTTTTTAAATACA 
AAAATACATAAAAATACATAAAATCTGATGATGAATATAAAAAAGTAACCAACCTCATTGGA 
ACAAGTATTAACATTTTGGAATATGTTTTATTAGTTTTGTGATGTACTGTTTTACAATTTTT 
ACCATTTTTTTCAGTAATTACTGTAAAATGGTATTATTGGAATGAAACTATATTTCCTCATG 
TGCTGATTTGTCTTATTTTTTTCATACTTTCCCACTGGTGCTATTTTTATTTCCAATGGATA 
TTTCTGTATTACTAGGGAGGCATTTACAGTCCTCTAATGTTGATTAATATGTGAAAAGAAAT 
TGTACCAATTTTACTAAATTATGCAGTTTAAAATGGATGATTTTATGTTATGTGGATTTCAT 
TTCAATAAAAAAAAACTCTTATCAAAAAAAAAAAAAAAAAAAAAAAAAAl^AAAAAAAAAA 



FIGURE 202 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA53912 
<subunit 1 of 1, 201 aa, 1 stop 
<MW: 22563, pi: 4.87, 1SIX(S/T) : 1 

MEYHPDLENLDEDGYTQLHFDSQSNTRIAWSEKGSCAASPPWRLIAVILGILCLVILVIAV 
VLGTMGVLSSPCPPNWIIYEKSCYLFSMSLNSPJDGSKRQCWQLGSNLLKIDSSNELGFIVKQ 
VSSQPDNSFWIGLSRPQTEVPWLWEDGSTFSSNLFQIRTTATQENPSPNCVWIHVSVIYDQL 
CSVPSYS I CEKKFSM 

Important features: 

Type II transmembrane domain: 

amino acids 45-65 

CAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 197-200 

N-myristoylation sites, 
amino acids 35-40 and 151-156 

Homologous region to LDL receptor 

amino acids 34-67 and 70-200. 



FIGURE 203 

CaiCAGGCTGGAGTGCS«3TG6TGTGATCOTGGCrcaTCGTi^C^ 
TCaGCCTCCCGAGTAGCTGGGATTACAGGTGGTGaVCTTCCAAGAGTGACT^ 

GAAGACTTTCGCrTCTGaiGCCA6a3GaACCaGACaCAC^GGAGakGCCrCCa<^^ 

CGCATCTCCATCGAGAACTCCGAAGAGGCCCTCACAGTCCATGCCCCTTTCCCTGCSkGCCCaCCCT 

TCCTTCCCTGACCCCAGGGGCCTCTACCACTTCTGCCTCTACTGGAACCGACATGCTGGGAGATTACATCTTCTC 

TATGGCAAGCGTGACTTCTTGCTGAGTGACAAAGCCTCTAGCCTCCTCTGCTTCCAGCACCAGGAGGAGAGCCTG 

GCTCAGGGCCCCCCGCTGTTAGCCACTTCTGTCACCTCCTGGTGGAGCCCTCAGAACaVTCAGCCTGCCCaGTC 

QCCaGCTTCACCOTCTCCTTCCACAGTCCTCCCCACaVCGGCCGCTCACAATGCCTCGGTGGACA 

AAAAGGGACCTCCaGCTGCTCAGCCAGTTCCTGAAGa^TCCCCAGAAGGCCTa^^GGAGGCCCTCGGCT 

GCCAGCCaGaMSTTGCAGftGCCTGQAGTCQAaAOTGACCTCTGTGAGATTCaTGGGGGAC^ 

GAGGACCGGATCS^CGCCaaKTGTGGAAGCTCCaGCCCAa^GCCGGCCTCCAGGACCTGC^ 

C3VGGRGGAGGAGCAQAGCGAGATCATGGAGTACrCG<SnXTGCrGCCT 

CGGAGCGGGQAGGCTmGAAQAGACTCCTCCTGGTGGACTTCAGCaLGCCaAGCCCTGTTCC^ 

AGCCaAGTCCTGGGTGAGAAGGTCTTGGGGATTGTGGTACAGAACS^CCAAAGTAGCCAACCTCAC^^ 

GTGCTCACTTTCCAGCACCAGCTACAGCCGAAGAATGTGACTCTGCAATGTGTGTTCTGGGTTGAAGACCCCACA 

TTGAGCAGCCCGGGGCATTGGAGCAGTGCTGGGTGTGAGACCGTCAGGAGAGAAACCa^AACaTCCTGCTTCTGC 

AACCACTTGACCTACTTTGCaGTGCTGATGGTCTCCTCGGTGGAGGTGGACGCCGTGCaCAAGCaiCTACCTGAGC 

CTCXITCTCCTACGTGGGCTGTGTCGTCTCTGCCCTGGCCTGCCTTGTCACCATTGCCGCCTACCTCTGCTCCAGG 

GTGCCCCTGCCGTGCAGGAGGAAACCTCGGGACTACACCATCAAGGTGCACATGAACCTGCTGCTGGCCGTCTTC 

CTGCTGGACACGAGCTTCCTGCTCAGCGAGCCGGTGGCCCTGACAGGCTCTGAGGCTGGCTGCCXSAGCC^ 

ATCOT^CCTGCACTTCTCCCTGCTO^CCTGCCTTTCCrGGATGGGCCTaaAGGGGT^ 

GTGGAGGTCOTTGGOlCCTATGTCCCTGGCTACCTACTa^GCTGAGCXKXA^ 

CK3GTGACXK:TGGTGGCCCTGGTGGATGTGGACAACTATGGCCCCAT<3^TCT 

GGCGTCATOTACCCTTCCaTGTGCTGGATCCGGGACTCCCTGGTCAGCTACATCACCAACCTGGGCCTCTTCAG^ 

CTGGTGTTTCTGTTCAACATGGCCATGCTAGCa^CCATGGTGGTGCaGATCCTGCGGCTGCGCCCCCAC^^ 

AAGTGGTCACATGTGCTGACACTGCTGGGCCTCAGCCTGGTCCTTGGCCTGCCCTGGGCCTTGATCTTCTTCTCC 

TTTGCTTCTGGCACCTTCCmGCTTGTCGTCCTCTACCTTTTCAGCATCATCACCTCCTTCCaA^ 

TTCATCTGGTACTGGTCCATGCGGCTGCAGGCCCGGGGTGGCCCCTCCCCTCTQaAGAGCAACTCAQACAGCGCC 

AGGCTCCCCATCAGCTCGGGCAGCACCTCGTCCaGCCGCATCTMGCCTCCAGCC^ 

CaWSAQATGCGGCCTCGTCGCACS^CTGCCTGTGGCCCCCGAGCCAGGCCCAGCCCCAGGCCaVGTCA 

GCCrTGGGQACTACTa3GCTCTCACT<axXrrCCCaCGGGACTC^ 

TCCCCACATCTGTCCa^CCaVGCTGGAGGCCTGGTCTCTCCTTACAACCCCT^ 

GGCCAGGCCTTGGATCTTGAGGGTCTGGCACaTCCTTAATCCTGTGCCCCTGCCTGGQACA 

CGAATGGGGCCCAGGGCAGACCTTCAGGGCa^.GAGCCCTGGCGGAGGAGAGGCCCTTTGCCAGGAGCACAGa^^ 
AGCTCGCCTACCTCTQAGCCCAGGCCCCCTCCCTCCCTCAGCCCCCCAGTCCTCCCTCCATCTTCCCTGGGGTTC 
TCCTCCTCTCCCAGGGCCTCCTTGCTCCTTCGTTCACAGCTGGGGGTCCCCGATTCCAATGCTGTTTTTTGGGGA 
GTGGTTTCCAGGAGCTGCCTGGTGTCTGCTGTAAATGTTTGTCTACTGCACAAGCCTCGGCCTGCCCCTGAGCCA 
GGCTCGGTACCGATGCGTGGGCTGGGCTAC^TCCCTCTGTCCATCTGGGCCTTTGTATGAGCTGCATTGCCCTTG 
CTaVCCCI^CCAAGCACACGCCTCaGAGGGG<X:CTCAGCCTCTCCTr^^ 

GAGCCTGA<3^CTCTCCTAAGAGGTTOTOTCCAAGCCCCCAAATAGCTCCAGGCGCCCTa3^ 

TAATTCTGTCCa^CaAACACT^CACGGGTAGATTGCTGGCCTGTTGTAGGTGGTAGGGACAa^GAT^ 

GTCACTCCTCCTGCCaAa\TTCAGTCTGGTATGTGAGGCGTGCGTGAAGCaAGaACTCCTGGAGCTACAGGG^ 

GGGAGCCATCATTCCTGCCTGGQAATCCTGGAAGACTTCCTGCAGGAGTCAGCGTTCAATCTTGACX:TTGaaGAT 

GGGAAGGATGTTCTTTTTAa3TACaW^TTCTTTTGTCTTTTGATATTAA2iAAGAAGTACATGTTaVTTGT^^ 

ATTTGGAAACTGTAGaAGAGaATCAAGAAGAAAAATAAAAATCaGCTGTTO 

AAAAAAAAAAAJ^AAAAAAAAAAAAAAAaAAAAAAAAAAAAAaAAAAAAAAAAAAAAA^^ 
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< /usr / s eqdb2 / s s t/DNA/Dnaseqs . min/ss . DNA5 0921 
<svibiinit 1 of 1, 693 aa, 1 stop 
<MW: 77738, pi: 8.87, NX(S/T) : 7 

MTPQSLLQTTLFLLSLLFLVQGAHGRGHREDFRFCSQRNQTHRSSLHYKPTPDLRISIENSE 
EALTVHAPFPAAHPASRSFPDPRGLYHFCLYWNRHAGRLHLLYGKRDFLLSDKASSLLCFQH 
QEESLAQGPPIJIATSVTSWWSPQNISLPSAASFTFSFHSPPHTAAHNASYDMCELKRDLQLL 
SQFLKHPQKASRRPSAAPASQQLQSLESKLTSVRFMGDMVSFEEDRINATVWKLQPTAGLQD 
LHIHSRQEEEQSEIMEYSVLLPRTLFQRTKGRSGEAEKRLLLVDFSSQALFQDKNSSQVLGE 
KVLGIWQNTKVANLTEPVVLTFQHQLQPKNVTLQCVFWVEDPTLSSPGHWSSAGCETVRRE 
TQTSCFiMLTYFAVLlWSSVEVDAVHKHYLSLLSYVGCVVSALACLVTIl^YLCSRVPLPC 
RRKPRDYTIKVHMNLLLAVFLLDTSFLLSEPVALTGSEAGCRASAIFLHFSLLTCLSWMGLE 
GYNLYRLWEWGTYVPGYLLKLSAMGWGFPIFLVTLVALVDVDNYGPIILAVHRTPEGVIY 
PSMCWIRDSLVSYITNLGLFSLVFLFNMAMLATMVVQILRLRPHTQKWSHVLTLLGLSLVLG 
LPWALIFFSFASGTFQLWLYLFSIITSFQGFLIFIWYWSMRLQARGGPSPLKSNSDSARLP 
ISSGSTSSSRI 

Importeuit features: 
Signal peptide: 

amino acids 1-25 

Putative transmembrane domains: 

amino acids 382-398, 402-420, 445-468, 473-491, 519-537, 568-590 
and 634-657 

Microbodies C- terminal targeting signal. 

amino acids 691-693 

CAMP- and cGMP- dependent protein kinase phosphorylation sites, 
amino acids 198-201 and 370-373 
N-glycosylation sites. 

amino acids 39-42, 148-151, 171-174, 234-237, 303-306, 324-327 
and 341-344 

G-protein coupled receptors family 2 proteins 

amino acids 475-504 
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TGCCTGGCCTGCCTTGTCaACAATGCCGCTTACTCTGCTTCCAGGTTGCCCTGCCTTGCAGA 

GGAAANCNTCGGGACTACACOITCaAGTGCACATGAACCTGCTGCTGGCCGTCT 

GACACGAGCTTCCTGCTCAGCGNAGCCGGTGGCCCTGACAGGCTCTGAAGGCTGGCTGCCGA 

GCCAGTGCCATCTTCCTGCACTTCTCCTGCTCACCTGCCTTTCCTGGATGGGCCTCGAGGGG 

TACAACCTCTACCGACTCGTGGTGGAGGTCTTTGGCACCTATGTCCCTGGCTACCTACTCAA 

GCTGAGCGCCATGGGCTGGGGCTTCCCCATCTTTCTGGTGACGCTGGTGGCCCTGGTGGATG 

TGGACAACTATGGCCCCATCATCTTGGCTGTGCATAGGACTCCAGAGGGCGTCATCTACCCT 

TCCATGTGCTGGATCCGGGACTCCCTGGTCAGCTACATCACCAACCTGGGCCTCTTCAGCCT 

GGTGTTTCTGTTCAACATGG 



FIGURE 206 

CGGACGCGTGGGCGGACGOSTGGGCGGACGCGTGGGCGQACGCGTGGGCTGGTTCAGGTCC^ 

TCCTTTTCAAAAACTGGAGACACIAGAAGAGGGCTCTAGGaAaaAGTl^GGaTGGGa^ 

GCGATTCTCTGCTGCCaGAG(:a.GGCTCGGCGCTTCCACCCa^GT<KaWK:(:OTCC^^ 

TCGGGAGTCGCTGCTTCCAAAGTGCCCGCCGTGAGTGAGCTCTO^CCCCaGTCaGCa^ 

TTCTCCTGCTGACATCTGCCCTGGCCGGCCAGaGACAGGGGACTCaGGCGGAATCC3UVCOTQA.GTAGT 

AGTTTTCCAGCAACa^GGAACaGAACGQAGTACaAGATCCTCAGCaTGAGAGAATTATTACTGTGTCTA^ 

GAAGTATTCACAGCCCAAGGTTTCCTCATACTTATCCAAGAAATACGGTCTTGGTATGGAGATTAGTAGCAGTAG 

AGGAAAATGTATGQATACAACTTACGTTTOATGAAAGATTTGGGCTTGAAGACCCAGAAGATGACATATGCaAGT 

ATGATTTTGTAGAAGTTGAGGAACCCAGTGATGGAACTATATTAGGGCGCTGGTGTGGTTCTGGTACTGTACCAG 

GaAAACAGATTTCTAaAGGAAATa^TTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGAACCA^ 

TCTGCATC<mCTACAACaTTGTCATGCCaa^TTCACaQZ^GCTGTGAGTCCTTCAGTGCTACCC^ 

TGCCACTGGACOTGCTTAATAATGCTATAACTGCCTTTAGTACCTTGGAAGACCTTATTCGATATCra^ 

AGAGATGGCaGTTGGACTTAGftAGATCTATATAGGCCSiACTTGGaUiCTTCTTGGCaAGGCTT^ 

GAAAATCCAGAGTGGTGGATCTGAACCTTCTAACAGAGGAGGTAAGATTATACAGCTGCACACCTCGTAACTT^ 

CAGTGTCCATAAGGGAAGZ^CTAAAGAGAACCGATACKS^TTrrCTGGCCAGGTTGTCTCCTG^ 

GTGGGAACTGTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACC 

ACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATa^CTCACCGACGTGGCCCTG^ 

ACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCCGCATCACCACCAGCa^ 

GAGCTGTGCAGTGCAGTG6CTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCT 

TCaAGGACCrTTCaLTCTTCaGGaTTTACAGTGakTTCTG?\A?VGAGGAG^ 

ACAGCTCTTTTGaGAGGAGGCCTAAAGGACAGGAQZ^AAAGGTCTTCAATCGTGGA?^GA^ 

TAAATAGATOlCa^GCTAGTTTCaGAGTTACaVTGTACGTATTCCACTAGCTGGGTTCTGTATTTa^ 

GATACXSGOTTAGGGTAATGTCAGTACAGGAAAaZ^CTGTGa^GTGAGCaCCT 

TCTAAAGCTCCATGTCCTGGGCCTAAAATCGTATAAAATCTGGATTTTTTTTTTTTTTTTTGCT^^ 

ATGTAAACCa.GftACATTCTATGTACTACAAACCTGGTTTTTAAAAAGG2\ACTATGTTGCTAT^^ 

GTCATGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAATTTCTGCCATTTAGAAGAAGAGAACTACA 

TTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTG 

TTTCATTGTGTACATTTTTATATTCTCCTTTTGACATTATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCT 

ATTTTTACCAAAGGTATTTAATATTCTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCT 

AAACACaATTGTTATAGCa^GAGGAACAAAGATGATATAAAATATTGTTGCTCTGACaAAAATACS^^ 

TTCTaSTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAA 

GACTTTTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTAT^ 

AAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCA^ 

AAAGCACCTTGAAAAAGACTTGGCaGCTTCCTGATAaAGCGTGCTGTGCTGTCOVGTAGGa^ 

^^^^^^^^'^^'^'^'^'^^^^^^'^^^^TTCCATAGACTTGTATAAATA^ 

QAAGTATGTCTCTTAACCaGTTCACTTATTGTACrCTGGCaATTTAAa^ 

AAAATGCOTAATATNGTGCCTAGGTTATGTGGTGACTATTT^ 

ATGTGGCnSVTTTTGGGGAGAAAATTAAAAAAAflAZUiAAAJUiAAaAa^ 
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MSLFGLLLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERIITVSTNGSIHSPR 
FPHTYPRim^VWRLVAVEENVWIQLTFDERFGLEDPEDDICKYDFVEVEEPSDGTILGRWC 
GSGTVPGKQISKtaiQIRIRFVSDEYFPSEPGFCIHYNIVMPQFTEAVSPSVLPPSALPLDLL 
MNAITAFSTLEDLIRYLEPERWQLDLEDLYRPTWQLLGKAFVFGRKSRWDLNLLTEEVRLY 
SCTPRNFSVSIREELKRTDTIFWPGCLLVKRCGGNCACCLHNCNECQCVPSKVTKKYHEVLQ 
LRPKTGVRGLHKSLTDVMiEHHEECDCVCRGSTGG 



Signal sequence: 
amino acids 1-14 



FIGURE 208 



CXXy^TCTCSUMSCTGATCOTGGCaCCTCTCATGCTCrGCT^ 

AGACTAAAAMTGGTGTTTCCaATGTGGACSlCTGAAGAGACS^ 

AAACTCOTTGGGGCTAGATGGTTTCCTA2^AACTCTGCCCTGTGAT6TCACTCT^^ 

ATCGTGGACTGCaCAGACaAGCS^TTTGAC^GAAATTCCTGGaGGTATTCCCAaaaAC^^ 

ACCATTAACCACATACCaGACaTCTCCCCAGCGTCCTTTCACyiGACTGGACCMCTGGTAGAGATC^ 

TGCAACTGTGTACCTATTCCACTGGGGTCAAAAAAC?VACATGTGCATCaAGAGGCTGCA 

TTTAGTGGACTCACTTATTTAAAATCCCTTTACCTGGATGGAAACCAGCTACTAGAGATACCGCAGGGCCTCCCG 
CCTAGCTTACAGCTTCTCAGCCTTGAGGCG^^O^CATCTTTTCCATaySAAZ^GAGA^ 

AAOVTAGAAATACTCTACCTGGGCCT^AAACTGTTATTATCGAAATCCTTGTTATGTTTCATATTCAATAGAQAAA 

GATGCCTTCCTAAACTTGAC3^AAGTTAAaAGTGCrCTCCCTGA2iAGATAACAATGTCAC^ 

TTGCakTCTACTTTAAOlGAACTATATCTCTACaACaACaTG^ 

TGTAAAAATAATTCTCCCCTACAGATCCCTGTAAATGCTTTTGATGCGCTGACAGaATTAAAAGTT^ 

CACAGTAACTCTCTTCAGCaiTGTGCCCCCaAGATGGTTTAaGaACATCaACa^ 

CAAAACTTCTTGGCCAAAGAaATTGGGGATGCTAAATTTCTGCATTTTCTCCCCAGCCXCa 

TCTTTCAATTTTGAACTTCAGGTCTATCGTGCATCTATGAATCTATaiCAAGCATTTTCTTC3^CTGAAAA<^ 

AAAATTCTGCGGATCAGAGGATATGTCTTTAAAGAGTTGAl^GCTTTAACCTCTCGCCaTTACATAATCITC^ 

AATCTTGAAGTTCTTGATCTTGGOVCTAACTTTATAAAAATTGCTAACCTCaVGCATGTTTAAACaATTT^^ 

CTGAAAGTCaTAGATCTTTCAGTGAATAAAATATCaCCTTa^GGAGATTCaAGTGAAGTTGGCTTCTGCTC^^ 

GCCAGaACTTCTGTAGaAAGTTATGAACCCa^GGTCCTGQAACaATTACATTATTTCAG^^ 

AOSAGTTGCaGtATTCaAAAACS^GAGGCTTCTTTCaTGTCTGTTAATGAAAGCTGCTACAAG 

TTGQATCTAAGTAAAAATAGTATATTTTTTGTCAACTCCrCTGATTTTCaiGCATCTTTCT^ 

AATCTGTCAGGAAATCTCATTAGCCaAACTCOTAATGGCaGTGAAT^^ 

GACTTCTCCAACaACCGGCTTGATTTACTCCATTCaACSVGCATTTGAaGAGCTT^ 

ATAAGCSkGTAATAGCCATTATTrrCAATCAGAAGGAATTACTCATATGCTAAACTTTAC 

CTGCaGAAACTGATGATGAACGACAATGACATCTCTTCCTCCACCaGCSlGGACCATGGAGAGTGAGTOTOT 

ACTCTGGAATTCAGAGGAAATOlCTTAGATGTTTTATGGAGAGAAGGTGATAACaGATACTTAaUVTTATTC^^ 

AATCTGCTAAAATTAGAGQAATTAGACATCTCTAAAAATTCCCTAAGTTTCTTGCCTTCTGGAGTTTTTGATGGT 

ATGCCTCCAAATCTAAAGAATCTCTCTTTGGCaVAAAATGGGCTCT^TCTTTCAGTTGGAAGAAACT 

CTAAAGaACCTGGAAACTTTGGACCTCAGCCaCS^CCSUiCTGACCACTGTCCCTGAGAGATT 

AGaAGCCTCaAGAATCTGATTCTTAAGAATAATCaAATCAGGAGTCTGACGAAGTATTTTCTAC^ 

CaGTTGCGATATCTGGaTCTCAGCTa«ATAAaATCCAGaTGATCCAAAAGACC^^ 




aWVAGTGTGATCTCCCTGGATCTGTACaCCTGTGAGTTAGATCTQACTAACCTGATTCTGTTCTa^CTTTC^^ 

TOTGTATCTCTCTTTCTCATGGTGATGATGACAGCaAGTCACCTCTATTTCTGGGATGTGTGGTATATTTAC^^ 

TTCTGTAAGGCCAAGATAAAGGGGTATCAGCGTCTAATATCACCAQACTGTTGCTATGATGCTTTTATTGTGTAT 

GAO^CTAAAGACCCAGCTGTGACCGAGTGGGTTTTGGCTGAGCTGGTGGCCAAACTGGAAGACCCAAGAGAGAAA 

CATTTTAATTTATGTCTCGAGGAAAGGGACTGGTTACCAGGGCAGCCAGTTCTGGAAAACCTTTCCCAGAGCATA 

CAGCTTAGO^AAAAGACAGTCTTTGTGATGACAGACAAGTATGCAAAGACTGAAaA^ 

TTGTCCCaTC^GAGGCTC&TGGATGaAAaAGTTGRTGTGATTATCTTGATATTTCTTGAGAAGCCCTTTCA^^ 
TCa^GTTCCTa3«3CTCCX3GAAAAGGCTCTGTGGG3^GTTCTGTCCTTGAGTGGCC^ 
CaiTACTTCTXSGCaGTGTCTAAAGAACXKCCTOGCCSiCaGA 
ACGGTCTaSCCCTTCTTTGCAAAACACS^CTGCCTAGTTTACCS^GGA^ 



nmr 
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tWFPMWTLKRQILILFNIILISKLLGARWFPKTLPCDVTLDVPKNHVIVDCTDKHLTEIPGG 

IPTNTTNLTLTINHIPDISPASFHRLDHLVEIDFROJCVPIPLGSKNm 

GLTYLKSLYLDGNQLLEIPQGLPPSLQLLSLEJ^IFSIRKENLTEIiANIEILYLGQNCYYR 

NPCWSYSIEKDAFLNLTKLKVLSLKDNNVTAVPTVLPSTLTELYLYNNMIAKIQEDDFN^ 

NQLQILDLSGNCPRCYNAPFPCTlPCKNNSPLQIPVISrAFDALTELKVLRLHSNSLQHVPPRWF 

KNINKLQELDLSQNFLAKEIGDAKFLHFLPSLIQLDLSFNFELQVYRASMNLSQAFSSLKSL 

KI LRIRGYVFKELKSFNLSPLHNLQNLEVLDLGTNFIKIANLSMFKQFKRLKVIDLSVNKI S 

PSGDSSEVGFCSNARTSVESYEPQVLEQLHYFRYDKYARSCRFKNKEASFMSVNESCYKYGQ 

TLDLSKNSIFFVKSSDFQHLSFLKCLNLSGNLISQTLNGSEFQPLAELRYLDFSNNRLDLLH 

STAFEELHKLEVLDI SSNSHYFQSEGITHMLNFTKNLKVLQKLMMNDND I S S STSRTMESES 

LRTLEFRGNHLDVLWREGDNRYLQLFKNLLKLEELDISKNSLSFLPSGVFDGMPPNLKNLSL 

AKNGLKSFSWKKLQCLKNLETLDLSHNQLTTVPERLSNCSRSLKNLILKNNQIRSLTKYFLQ 

DAFQLRYLDLSSNKIQMIQKTSFPENVLmLKMLLLHHNRFLCTCDAWFVWVmraTEVT 

YLATDVTCVGPGAHKGQSVISLDLYTCELDLTNLILFSLSISVSLFIJMViyiMTASHLYFWDVW 

YIYHFCKAKIKGYQRLISPDCCYDAFIVYDTKDPAVTEWVLAELVAKLEDPREKHFNLCLEE 

RDWLPGQPVLENLSQSIQLSKKTVFVMTDKYAKTENFKIAFYLSHQRLMDEKVDVIILIFLE 

KPFQKSKFLQLRKRLCGSSVLEWPTNPQAHPYFWQCLKNAIATDiraVAYSQVFKETV 

Signal sequence: 

amino acids 1-26 

Transmembrane domain: 
amino acids 840-860 
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GG6TACCATTCTGCGCTGCTGCAAGTTACGGAATGAAAAATTAG?^CaACAG^U^CATG^^ 

AGTCGTCaATGCTGACCTGOVTTTTCCTGCTAATATCTGGTTCCTGTGAGTTATGCGCCG^ 

GAAGCTATCCTTGTGATGAGAAAAAGCAAAATGACTaVGTTATTGCAGAGTGCAGa^ 

TTCCCCAAAOSGTGGGCaiAATATGTGAa^GAACiyiGACCTG^ 

CaTTTO^GGGCTGCAAAATCTCACTAAAATAAATCTAAACCAa^CCCCW^^ 

CCGGTATACAATa\AATGGCTTGAATATCACAGACGGGGCATTCCTCAACCTAA?iAAACCTAAGGGAGTTAC 

TTGAAGACRACCAGTTACCCCAAATACCCTCTGGTTTGCCAGAGTCTTTGACAGAACTTAGTCTAATTCa^ 

ATATATACAACATAACTAAAGAGGGC3^TTTCSy\.GACTTATAAACTTGAAAAATCTCTATTTGGCCTGGAACT 

ATTTTAACAAAGTTTGCGAGAAAACTAACATAGAAGATGGAGTATTTGAAACGCTQAO^AATTTGGAGTTGCTAT 

CaCTATCTTTOU^TTCTCTTTCACACGTGCCACCCAAACTGCCAAGCTCCCTACGCAAACTTTTTCTGAGC^ 

CCCAGATCSU^TACATTAGTGAAGAAGATTTCaAGGGATTGATAAATTTAACATTACTAGATTTAAGC 

GTCCX3AGGTGCTTCAATGCCCCATTTCCATGCGTGCCTTGTGATGGTGGTGCTTCAATTAATATAGATCGTTTTG 

CTTTTCAAAACTTGACCCa^CrTa3ATACCTAAACCTCTCT 

TTAAAAATATGCCTCaTCTGAAGGTGCTGGATCTTGAATTCAACTATTTAGTGGGAGA 

TTTTAACX3ATGCTGCCCCX3CTTAG3^TACTTGACTTGTCTTTTAACTATATAaAGGGGAGTTATC^ 

TTAATATTTCCSU3AAACTTCTCTAAACTTTTGTCTCTACGGGCATTGCaTTTAAGAGGTTATGT^ 

TCAGAGAAGATGATTTCCAGCCCCTGATGCAGCTTCCAAACTTATCGACTATCaACTTGGGTATTAATTTTATTA 

AGa^TCGATTTOWUkCTTTTCCAAAATTTCTCCAATCTGGAAATTATTTACTTGTCaiGAAl^ 

CGTTGGTAAAAGATACCCGGCaGAGTTATGCAAATAGTTCCTCTTTTCa^CGTCATATCCGGAAACGACGCTCAA 

CAGATTTTGAGTTTGACCCACATTCGAACTTTTATCATTTCACCCGTCCTTTAATAAAGCCACAATGTGCTGCTT 

ATGGAAAAGCCTTAGATTTAAGCCTCAACAGTATTTTCTTCa^TTGGGCCaAACCAATTTGAAAATCTTCCTGACA 

TTGCCTGTTTAAATCTGTCTGCAAATAGCAATGCTCAAGTGTTAAGTGGAACTGAATTTTCAGCCA 

TCAAATATTTGQATTTGAO^CAATAGACTAGACTTTGATAATGCTAGTGCTCTTACTGi^^ 

AAGTTCTAGATCTCAGCTATAATTCAaVCTATTTCAGAATAGCaWSGCGTAA 

ATTTCACAAATCTAAAAGTTTTAAACTTGAGCCACaACS^CMJTTATACTTTAACAG^ 

GCaAGTCCCTGGTAGAATTAGTTTTCa^GTGGCaATCGCCTTGACATTTTGTGGAATGATGATGAC^ 

TCTCCATTTTCaAAGGTCTCa^GAATCTGACy^CGTCTGGATTTATCCCTTAATAGGCTGAAGCACATCCC^^ 

AAGCATTCCTTAATTTGCCAGCGAGTCTCACTGAACTACATATAAATGATAATATGTTAAAGTTTTTTAACTGGA 

CATTACTCCAGCAGTTTCCTCGTCTCGAGTTGCTTGACTTACGTGGAAACAAACTACTCTTTTTAACTGATAGCC 

TATCTGACTTTACATCTTCCCTTCGGACACTGCTGCTGAGTCATAACAGGATTTCCCACCTACCCTCTGGCTTTC 

TTTCTGAAGTOVGTAGTCTGAAGCACCTCGATTTAAGTTCa^TCTGCTAAAAACAATC^ 

AAACTAAGACCACCaLCa^TTATCTATGTTGQAACTACACGGAAACCCCTTTGAATGCaiCCTGTGAa^ 

ATTTCCGAAQATGGATGGATGAACATCTGAATGTCaAAATTCCCy^GACTGGTAGATGTCaTTTGTGC 

TTTTCTTCACGTTCTTTATCaCCa^CCATGGTTATGTTGGCK3CCCnX3GCTCACCATTTG^ 

GGTTTATATATAATGTGTGTTTAGCTAAGGTAAAAGGCTACAGGTCTCTTTCCACATCCCAAACTTTCTATGATG 
CTTACATTTCTTATGACACCAAAGATGCCTCTGTTACTGACTGGGTGATAAATGAGCTGCGCTACCACCTTGAAG 
AGAGCCGAGACAAAZUVCGTTCTCCTTTGTCTAGAGGAGAGGGATTGGGACCCGGGATTGGCCATCATCGACAACC 
TC»LTGCAGAGCATCa\ACCa2\AGCAAGAAAACAGTATTTGTTTTAACCA?^i^^ 

AAACSiGCTTTTTACTTGGCTTTGCAGAGGCTAATGGATGAGAACaTGGATGTGATTATATTTATCCTGCTGGAGC 
O^GTGTTACAGCATTCTCAGTATTTGAGGCTACGGCAGCGGATCTGTAAGAGCTCCATCCTCCAGTGGCCTGACA 

ATATGTATGTCXSATTCa^TTAAGCAATACTraACTGaaSTTAftGTCATGAra^ 

GAATGAOVTTTCTGTATTAGTTATCTATTGCTATGTAACaAATTATCC^^ 

TTTGCTGGCCOlCaGTTTTTQAGGGTCAGGAGTCCaGGCCCAGCaTAACTGGGTCCrCl^ 

AGGCTGCAATGTAGGTGTTCACCAGAGACATAGGCATCAOTGGGGTCACAOTC^ 

ATTCCTCCTGGGCTATTGGCCAAAGGCTATACTCATGTAAGCCATGaSAGCCTCTCCCACaAGGC^ 

ATCAGAGCTAGCAAAAAAGAGAGGTTCCTAGCAAGATGAAGTCAa^TCTTTTGTAATCGAATCA^ 

lATCraVTCACTTTGGCCATATTCTATTTGTTAGAAGTAAACCAGAGGTCCCaCCA. 

TCAGTCCAGGGAAAACAGCTGAAGACCAAGATGGTGAGCTCTGATTGCTTCAGTTGGTCATCAACTATTTTCCCT 

TGACTGCTGTCCTGGGATGGCCTGCTATCTTGATGATAGATTGTGAATATCAGGAGGCAGGGATCACTGTGGACC 

ATCTTAGCAGTTGACCTAACACS^TCTTCTTTTa^TATCTAAGAACTTTTGCCACTGTGACTAATGGTCCT^ 

TTAAGCTGTTGTTTATATTTATCaTATATCTATGGCTACATGGTTATATTATGCTGTGGTTGCXSTTaM 

TTACaGTTGCTTTTAO^TATTTGCTGTAACATTTGACm 

ATAGCTTTTAAAGCATCTTTTACTTCOTACaiTTTTTT^ 

TTGTTAATTGaZATTGCTGTAAATCTTAAAATGAATQAATAAaAATGTTTC^ 
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MENMFLQSSMLTCIFLLISGSCELCAEENPSRSYPCDEKKQNDSVIAECSNRRLQEVPQTVG 
KYVTELDLSDNFITHITNESFQGLQNLTKINIJSJHNPWQHQNGNPGIQSNGIiNITIXSAFLNIj 
KNLRELLLEDNQLPQI PSGLPESLTELSLIQNNI YNITKEGI SRLINLKNLYIiAWNCYFNKV 
CEKTNIEDGVFETLTNLELLSLSFNSLSHVPPKLPSSLRKLFLSNTQIKyiSEEDFKGLINL 
TLLDLSGNCPRCFNAPFPCVPCDGGASINIDRFAFQNLTQLRYLNLSSTSLRKINAAWFKNM 
PHLKVLDLEFNYLVGEIVSGAFLTMLPRLEILDLSFNYIKGSYPQHINISRNFSKLLSLRAL 
HLRGYVFQELREDDFQPLMQLPNLSTINLGINFIKQIDFKLFQNFSNLEI I YLSENRI SPLV 
KDTRQSYANSSSFQRHIRKRRSTDFEFDPHSNFYHFTRPLIKPQCAAYGKALDLSLNSIFFI 
GPNQFENLPD I ACLNLSANSNAQVLSGTEFS AI PHVKYLDLTNNRLDFDNASALTELSDLEV 
LDLSYNSHYFRIAGVTHHLEFIQNFTNLKVLNLSHNNIYTLTDKYNLESKSLVELVFSGNRL 
DILPmDDDNRYISIFKGLKNLTRLDLSIJJRLKHIPNEAFLNLPASLTEIJIINDNMLKFFNWT 
LLQQFPRLELLDLRGNKLLFLTDSLSDFTSSLRTLLLSHNRISHLPSGFLSEVSSLKHLDLS 
SNLLKTINKSALETKTTTKLSMLELHGNPFECTCDIGDFRRWMDEHLNVKIPRLVDVICASP 
GDQRGKBIVSLELTTCVSDVTAVILFFFTFFITTMVMIJU^LAHHLFYraJVWFIYNVCLAKVK 
GYRSLSTSQTFYDAYISYDTKDASVTDWVINELRYHLEESRDKNVLLCLEERDWDPGLAIID 
NIMQSINQSKKTVFVLTKKYAKSWNFKTAFYIJ^QRLMDEIMVIIFILLEPVIKSHSQYLR^ 
RQRICKSSILQWPDNPKAEGLFWQTLR]WVLTEOTDSRY13NMYVDSIKQY 

Signal sequence: 
amino acids 1-26 

Transmembrane domain: 
amino acids 826-848 
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CCAGGTCCAACTGCACCTCGGTTCTATCGATTGAATTCCCCGGGGATCCTCTAGAGATCCCT 
CGACCTCGACCCACGCGTCCGCCAAGCTGGCCCTGCACGGCTGCAAGGGAGGCTCCTGTGGA 
GAGGCCAGGCAGGTGGGCCTCAGGAGGTGCCTCCR.GGCGGCCAGTGGGCCTGAGGCCCCAGC 
AAGGGCTAGGGTCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGC 
TCCAGCAGCATCAGCAGCCCCCAGGACCGGGGAGGCACAGGTGGCCCCCACCACCCGGAGGA 
GCAGCTCCTGCCCCTGTCCGGGGGATGACTGATTCTCCTCCGCCAGGCCACCCAGAGGAGAA 
GGCCACCCCGCCTGGAGGCACAGGCCATGAGGGGCTCTCAGGAGGTGCTGCTGATGTGGCTT 
CTGGTGTTGGCAGTGGGCGGCACAGAGCACGCCTACCGGCCCGGCCGTAGGGTGTGTGCTGT 
CCGGGCTCACGGGGACCCTGTCTCCGAGTCGTTCGTGCAGCGTGTGTACCAGCCCTTCCTCA 
CCACCTGCGACGGGCACCGGGCCTGCAGCACCTACCGAACCATCTATAGGACCGCCTACCGC 
CGCAGCCCTGGGCTGGCCCCTGCCAGGCCTCGCTACGCGTGCTGCCCCGGCTGGAAGAGGAC 
CAGCGGGCTTCCTGGGGCCTGTGGAGCAGCjy^TATGCCAGCCGCCATGCCGGAACGGAGGGA 
GCTGTGTCCAGCCTGGCCGCTGCCGCTGCCCTGCAGGATGGCGGGGTGACACTTGCCAGTCA 
GATGTGGATGAATGCAGTGCTAGGAGGGGCGGCTGTCCCCAGCGCTGCATCAACACCGCCGG 
CAGTTACTGGTGCCAGTGTTGGGAGGGGCACAGCCTGTCTGCAGACGGTACACTCTGTGTGC 
CCAAGGGAGGGCCCCCCAGGGTGGCCCCCAACCCGACAGGAGTGGACAGTGCAATGAAGGAA 
GAAGTGCAGAGGCTGCAGTCCAGGGTGGACCTGCTGGAGGAGAAGCTGCAGCTGGTGCTGGC 
CCCACTGCACAGCCTGGCCTCGCAGGCACTGGAGCATGGGCTCCCGGACCCCGGCAGCCTCC 
TGGTGCACTCCTTCCAGCAGCTCGGCCGCATCGACTCCCTGAGCGAGCAGATTTCCTTCCTG 
GAGGAGCAGCTGGGGTCCTGCTCCTGCAAGAAAGACTCGTGACTGCCCAGCGCCCCAGGCTG 
GACTGAGCCCCTCACGCCGCCCTGCAGCCCCCATGCCCCTGCCCAACATGCTGGGGGTCCAG 
AAGCCACCTCGGGGTGACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTCTTCCTCCTCCCC 
TTCCTCGGGAGGCTCCCCAGACCCTGGCATGGGATGGGCTGGGATCTTCTCTGTGAATCCAC 
CCCTGGCTACCCCCACCCTGGCTACCCCAACGGCATCCCAAGGCCAGGTGGGCCCTCAGCTG 
AGG6AAGGTACGAGCTCCCTGCTGGAGCCTGGGACCCATGGCACAGGCCAGGCAGCCCGGAG 
GCTGGGTGGGGCCTCAGTGGGGGCTGCTGCCTGACCCCCAGCACAATAAAAATGAAACGTGA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCGACTCTAGAGT 
CGACCTGCAGAAGCTTGGCCGCCATGGCCCaACnTGTTTATTGCAGCTTAT^ 



FIGURE 213 



MRGSQEVLIJyrWLLVIAVGGTEHAyRPGRRVCAVRAHGDPVSESFVQRVyQPFIiTTCDGHRAC 
STYRTIYRTAYRRSPGLAPARPRYACCPGWKRTSGLPGACGAAICQPPCRNGGSCVQPGRCR 
CPAGWRGDTCQSDVDECSAEIRGGCPQRCINTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVA 
PNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVLAPLHSLASQALEHGLPDPGSLLVHSFQQLG 
RIDSLSEQISFLEEQLGSCSCKKDS 



Signal sequence: 

1-19 



FIGURE 214 



GCCAGGCAGGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGCAAG 
GGCTAGGGTCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGCTCC 
AGCAGCATCAGAGCAGCCCCTGTGGTTGGCAGCAAAGTTCAGCTTGGCTGGGCCCGCTGTGA 
GGGGCTTCGCGCTACGCCCTGCGGTGTCCCGAGGGCTGAGGTCTCCTCATCTTCTCCCTAGC 
AGTGGATGAGCAACCCAACGGGGGCCCGGGGAGGGGAACTGGCCCCGAGGGAGAGGAACCCC 
AAAGCCACATCTGTAGCCAGGATGAGCAGTGTGAATCCAGGCAGCCCCCAGGACCGGGGAGG 
CACAGGTGGCCCCCACCACCCGGAGGAGCAGCTCCTGCCCCTGTCCGGGG6ATGACTGATTC 
TCCTCCGCCAGGCCACCCAGAGGAGAAGGCCACCCCGCCTGGAGGCACAGGCC ATGA GGGGC 
TCTCAGGAGGTGCTGCTGATGTGGCTTCTGGTGTTGGCAGTGGGCGGCACAGAGCACGCCTA 
CCGGCCCGGCCGTAGGGTGTGTGCTGTCCGGGCTCACGGGGACCCTGTCTCCGAGTCGTTCG 
TGCAGCGTGTGTACCAGCCCTTCCTCACCACCTGCGACGGGCACCGGGCCTGCAGCACCTAC 
CGAACCATCTATAGGACCGCCTACCGCCGCAGCCCTGGGCTGGCCCCTGCCAGGCCTCGCTA 
CGCGTGCTGCCCCGGCTGGAAGAGGACCAGCGGGCTTCCTGGGGCCTGTGGAGCAGCAATAT 
GCCAGCCGCCATGCCGGAACGGAGGGAGCTGTGTCCAGCCTGGCCGCTGCCGCTGCCCTGCA 
GGATGGCGGGGTGACACTTGCCAGTCAGATGTGGATGAATGCAGTGCTAGGAGGGGCGGCTG 
TCCCCAGCGCTGCATCAACACCGCCGGCAGTTACTGGTGCCAGTGTTGGGAGGGGCACAGCC 
TGTCTGCAGACGGTACACTCTGTGTGCCCAAGGGAGGGCCCCCCAGGGTGGCCCCCAACCCG 
ACAGGAGTGGACAGTGCAATGAAGGAAGAAGTGCAGAGGCTGCAGTCCAGGGTGGACCTGCT 
GGAGGAGAAGCTGCAGCTGGTGCTGGCCCCACTGCACAGCCTGGCCTCGCAGGCACTGGAGC 
ATGGGCTCCCGGACCCCGGCAGCCTCCTGGTGCACTCCTTCCAGCAGCTCGGCCGCATCGAC 
TCCCTGAGCGAGCAGATTTCCTTCCTGGAGGAGCAGCTGGGGTCCTGCTCCTGCAAGAAAGA 
CTCGTGACTGCCCAGCGCTCCAGGCTGGACTGAGCCCCTCACGCCGCCCTGCAGCCCCCATG 
CCCCTGCCCAACATGCTGGGGGTCCAGAAGCCACCTCGGGGTGACTGAGCGGAAGGCCAGGC 
AGGGCCTTCCTCCTCTTCCTCCTCCCCTTCCTCGGGAGGCTCCCCAGACCCTGGCATGGGAT 
GGGCTGGGATCTTCTCTGTGAATCCACCCCTGGCTACCCCCACCCTGGCTACCCCAACGGCA 
TCCCAAGGCCAGGTGGACCCTCAGCTGAGGGAAGGTACGAGCTCCCTGCTGGAGCCTGGGAC 
CCATGGCACAGGCCAGGCAGCCCGGAGGCTGGGTGGGGCCTCAGTGGGGGCTGCTGCCTGAC 
CCCCAGCACAATAA2\AATGAAACGTG 



FIGURE 215 



MRGSQEVLIJi<WLLVLAVGGTEHAYRPGRRVC7iVRAHGDPVSESFVQRW 

STYRTIYRTAYRRSPGIAPARPRYACCPGWKRTSGLPGACGAAICQPPCKNGGSCVQPGRCR 
CPAGWRGDTCQSDVDECSARRGGCPQRCINTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVA 
PNPTGVDSAMKEEVQRIKJSRVDLLEEKLQLVIiAPLHSLASQALEHGLPDPGSLLVHSFQQLG 
RIDSLSEQISFLEEQLGSCSCKKDS 



Signal sequence: 
1-19 



FIGURE 216 



CCCACGCGTCCGAAGCTGGCCCTGCACGGCTGCT^GGGAGGCTCCTGTGGACAGGCCAGGCA 
GGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGCAAGGGCTAGGG 
TCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGCTCCAGCAGCAT 
CAGCAGCCCCCAGGACCGGGGAGGCACAGGTGGCCCCCACCACCCGGAGGAGCAGCTCCTGC 
CCCTGTCCGGGGGATGACTGATTCTCCTCCGCCAGGCCACCCAGAGGAGAAGGCCACCCCGC 
CTGGAGGCACAGGCCATGAGGGGCTCTCAGGAGGTGCTGCTGATGTGGCTTCTGGTGTTGGC 
AGTGGGCGGCACAGAGCACGCCTACCGGCCCGGCCGTAGGGTGTGTGCTGTCCGGGCTCACG 
GGGACCCTGTCTCCGAGTCGTTCGTGCAGCGTGTGTACCAGCCCTTCCTCACCACCTGCGAC 
GGGCACCGGGCCTGCAGCACCTACCGAACCATCTATAGGACCGCCTACCGCCGCAGCCCTGG 
GCTGGCCCCTGCCAGGCCTCGCTACGCGTGCTGCCCCGGCTGGAAGAGGACCAGCGGGCTTC 
CTGGGGCCTGTGGAGCAGCAATATGCCAGCCGCCATGCCGGAACGGAGGGAGCTGTGTCCAG 
CCTGGCCGCTGCCGCTGCCCTGCAGGATGGCGGGGTGACACTTGCCAGTCAQATGTGGATGA 
ATGCAGTGCTAGGAGGGGCGGCTGTCCCCAGCGCTGCGTCAACACCGCCGGCAGTTACTGGT 
GCCAGTGTTGGGAGGGGCACAGCCTGTCTGCAGACGGTACACTCTGTGTGCCCAAGGGAGGG 
CCCCCCAGGGTGGCCCCCAACCCGACAGGAGTGGACAGTGCAATGAAGGAAGAAGTGCAGAG 
GCTGCAGTCCAGGGTGGACCTGCTGGAGGAGAAGCTGCAGCTGGTGCTGGCCCCACTGCACA 
GCCTGGCCTCGCAGGCACTGGAGCATGGGCTCCCGGACCCCGGCAGCCTCCTGGTGCACTCC 
TTCCAGCAGCTCGGCCGCATCGACTCCCTGAGCGAGCAGATTTCCTTCCTGGAGGAGCAGCT 
GGGGTCCTGCTCCTGCAAGAAAGACTCGTGACTGCCCAGCGCCCCAGGCTGGACTGAGCCCC 
TCACGCCGCCCTGCAGCCCCCATGCCCCTGCCCAACATGCTGGGGGTCCAGAAGCCACCTCG 
GGGTGACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTCTTCCTCCTCCCCTTCCTCGGGAG 
GCTCCCCAGACCCTGGCATGGGATGGGCTGGGATCTTCTCTGTGAATCCACCCCTGGCTACC 
CCCACCCTGGCTACCCCAACGGCATCCCAAGGCCAGGTGGGCCCTCAGCTGAGGGAAGGTAC 
GAGCTCCCTGCTGGAGCCTGGGACCCATGGCACAGGCCAGGCAGCCCGGAGGCTGGGTGGGG 
CCTCAGTGGGGGCTGCTGCCTGACCCCCAGCACAATAAAAATGAAACGTG 



FIGURE 217 



MRGSQEVLLMWLLVLAVGGTEHAYRPGRRVCAVRAHGDPVSESFVQRVYQPFLTTCDGHRAC 
STYRTIYRTAYRRSPGIAPARPRYACCPGWKRTSGLPGACGAAICQPPCRNGGSCVQPGRCR 
CPAGWRGDTCQSDVDECSAEIRGGCPQRCVNTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVA 
PNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVLAPLHSLASQALEHGLPDPGSLLVHSFQQLG 
RIDSLSEQISFLEEQLGSCSCKKDS 



Signal sequence: 
1-19 



FIGURE 218 



GGTTGCCACaGCTGGTTTAGGGa:CCGACCACrGGGGCCCCTTGTCaGGAGGAGA 
GACAAGTCGCTGCXACCTTTGGCTGCaSACGTGATTCCCTGGGa 

AGTTGGGTCTCCGTGTTTCa.GGa2GGCTCCX:CCTTCCTGGTCTCCCTTCrCCCGCTGGGC^ 
AGATTGTCTTCCAGGGCTAGCaATTGGACTTTTGATGATGTTTGaCCCAGCGGC^ 
TTCAAAGCTGGGCTCaGCCTCTGTTTCTTCTCTCGTGTAATCGCaAAACCCATTTTGQAGCA^^ 
TGTCTGTGATGGTGGTGAGAAAGAAGGTGACACGGAAATGGGaGAAACTCCCAGGCAGGAACSlCCTm 
ATGGCCGCGTCATGATGGCCCGGCaAAAGGGCATTTTCTACCTGaCCCrTTTCCTCaTCCTGGGGACS^TGTACa^ 
TCTTCTTCGCCTTTGAGTGCCGCTACCTGGCTGTTCAGCTGTCTCCTGCCATCCCTGTATTTGCraCCATGCTCT 
TCCTTTTCTCCATGGCTACACTGTTGAGGACCAGCTTCAGTGACCCTGGAGTGATTCCTCGGGCGCTACCAGATG 
AAGCAGCTTTCaTAGAAATGGAGATAGAAGCTACCAATGGTGCGGTGCCCCAGGGCCAGCQACCACCGCCTCGTA 
•TCAAGAATTTCCaGATAAACAACCAGATTGTGAAACTGAAATACTGTTACACATGC?^GATCTTCCGGCCTC 
GGGCCTCCCaTTGCAGCATCTGTQAa^CTGTGTGGAGaSCTTCGACCATCA^ 
TTGGAAAGAGGAACTACCGCTACTTCTACCTCTTCATCCTTTCTCTCTCCCrCCTCAC^ 
TCaACATCXSTCTATGTGGCCCTCS^TCTTTGaAAATTGGCTTCT 

TTCTAGAAGTCXrrCaTTTGCTTCTTTACACTCTGGTCCGTCGTGGGACTGACTGGATTTCATAClTTCC 

CTCTCAACCAGACAACC3^TGAAGACyVTCAAAGGATCATGGS.CAGGGAAGAATCGa3TCCAGAATCC^ 

ATGGCaATATTGTGAAGAACTGCTGTGAAGTGCTGTGTGGCCCCTTGCCCCCCAGTGTGCTGGATCGAAGGGGTA 

TTTTGCCACTGGAGGAAAGTGGAAGTCGACCTCCCAGTACTCAAGAGACCAGTAGCaGCCTCTTGCCACAGAGCC 

CyiGCCCCCACAGAACaCCTGAACTayATGAGATGCCGGAGGACAGCAGCACTCCCGAAGAGATGC^^ 

AGCCCCCAGAGCCACCACS^GGAGGCAGCTGAAGCTGAGAAGTAGCCTATCTATGGAAGAGACTTTTGTTTGTGTT 

TAATTAGGGCTATGAGAGATTTCAGGTGAGAAGTTAAACCTGAGACAGAGAGCAAGTAAGCTGTCCCTTTTAACT 

GTTTTTCTTTGGTCTTTAGTCACCa^GTTGCACaCTGGCaTTTTCTTGCTGC?\AGCTTTTT^ 

Ca^AGGCAGTGGCAGAAQATGTO^GTCACCTCrGATAACTGGAAAAATGGGTCTCTTGGGCCCTGG^^ 

CCATGGCCTCAGCCacaGGGTCCX:COTGGACCa:CTCTCTTCrCrCC^^ 

TGGTCTCATTCTGGGGCTAAAAGTTTTTGAGAOTGGCTCaUiATCCTCCCAAGCTGCTGC^ 

GGCAGTO^CAGAGACCTCTGGCCAGGGGATCCTAACTGGGTTCTTGGGGTCTTCAGGACTGAAQAGQAG^ 

TGGGGTO^GaAGATTCTCCTGGCCACCaAGTGCCAGCSVTTGCCCACaaATCCTTTTAGGAATGGGAa^ 

TCCa.CTTGTTGTANNWNNN]^^ 

CaVGGAATGGCAGTAATAAAAGTCTGCACTTTGGTCATTTCrTTTCCTCAGAGQAAGCCCGAGTGCTCaCTTAAAC 

ACTATCCCCTCAGACTCCCTGTGTGAGGCCTGCAGAGGCCCTGAATGCACAAATGGGAAACCAAGGCACAQAGAG 

GCTCTCCTCTCCTCTCCTCTCCCCCGATGTACCCTCAAAAAAAAAAAAATGCTAACCAGTTCTTCCATTAAGCCT 

CGGCTGAGTGAGGQAAAGCCCAGCACTGCTGCCCTCTCGGGTAACTCACCCTAAGGCCTCGGCCCACCTCTGGCT 

ATGGTAACCAO^CTGGGGGCTTCCn-CCaAGCCCCGCTCTTaza^ 

CACCCTGGGGGTGGGCTGTGGCCCCCAGTCaGCTCrraCTCAGGACCTCCTCT^ 

ATTATAT6TGGCTATATTTCCTAGAGCACCTGT6TTTTCCTCTTTCTAAGCCAGGGTCCTGTCT 

GCGGTGGGGGAGTGTAAACCGGAACTTTTCaTCTATTTGAAGGCGATTAAACTGTGTCTAATGCA 



FIGURE 219 



MSVPm7RKKVTRKWEKLPGRNTFCa)GRVMMARQKGIFYLTLFLIIiG 

QLSPAIPVFAAMLFLFSMATLLRTSFSDPGVIPRAIiPDEAAFIEMEIEATNGAVPQGQRPPP 
RIKNFQINKTQIVKLKYCYTCKIFRPPRASHCSICDNCVERFDHHCPWGNCVGKRNYRYFYL 
FILSLSLLTIYVFAFNIVYVALKSLKI6FLETLKETPGTVLEVLICFFTLWSWGLTGFHTF 
LVAJ^QTTNEDIKGSWTGKNRVQNPYSHGNIVKNCCEVLCGPLPPSVLDRRGILPLEESGSR 
PPSTQETSSSLLPQSPAPTEHLNSNEMPEDSSTPEEMPPPEPPEPPQEAAEAEK 



Putative transmembrane domains: 

amino acids 36-55 (type II TM) , 65-84, 188-208, 229-245 



FIGURE 220 

AAAACCCTGTATTTTTTACaATGCTykATAGACJ^TNANCCTGGAGGTCTTTGAATTA^ 

TATAGGGATGGTGGGGTTGATTTTTNTTCCTGGAGGCTTTTGGCTTTGGACTCTCNCTTTCT 

CCCACAGAGCaJICTTCGACCATCACTGCCCCTGGGTGGGGAATTGTGTTGGAAAGAGGAACTA 

CCGCTANTTCTACCTCTTCATCCTTTNTCTCTCCCNCCTCACAATCTATGTCTTCGCCTTCA 

ACATCGT 



FIGURE 221 



GTTGTGTCCTTCAGCAAAACAGTGGATTTAAATCTCCTTGCACAAGCTTGAGAGCAACACAA 

TCTATCAGGAAAGAAAGAAAGAAAAAAACCGAACCTGACAAAAAAGAAGAAAAAGAAGAAGA 

AAAAAAATCATGAAAACCATCCAGCCAAAAATGCAC^TTCTATCTCTTGGGC^ 

GGGGCTGGCTGCTCTGTGTCTCTTCCAAGGAGTGCCCGTGCGCAGCGGAGATGCCACCTTCC 

CCAAAGCTATGGACAACGTGACGGTCCGGCAGGGGGAGAGCGCCACCCTCAGGTGCACTATT 

QACAACCGGGTCACCCGGGTGGCCTGGCTAAACCGCAGCACCATCCTCTATGCTGGGAATGA 

CAAGTGGTGCCTGGATCCTCGCGTGGTCCTTCTGAGCAACACCCAAACGCAGTACAGCATCG 

AGATCCAGAACGTGGATGTGTATGACGAGGGCCCTTACACCTGCTCGGTGCAGACAGACAAC 

CACCCAAAGACCTCTAGGGTCCACCTCATTGTGCAAGTATCTCCCAAAATTGTAGAGATTTC 

TTCAGATATCTCCATTAATGAAGGGAACAATATTAGCCTCACCTGCATAGCAACTGGTAGAC 

CAGAGCCTACGGTTACTTGGAGACACATCTCTCCCAAAGCGGTTGGCTTTGTGAGTGAAGAC 

GAATACTTGGAAATTCAGGGCATCACCCGGGAGCAGTCAGGGGACTACGAGTGCAGTGCCTC 

CAATGACGTGGCCGCGCCCGTGGTACGGAGAGTAAAGGTCACCGTGAACTATCCACCATACA 

TTTCAGAAGCCAAGGGTACAGGTGTCCCCGTGGGACAAAAGGGGACACTGCAGTGTGAAGCC 

TCAGCAGTCCCCTCAGCAGAATTCCAGTGGTACAAGGATGACAAAAGACTGATTGAAGGAAA 

GA?^GGGGTGAAAGTGGAAAACAGACCTTTCCTCTCAAaA.CTCATCTTCTTCAATGTCTCTC 

AACATGACTATGGGAACTACACTTGCGTGGCCTCCAACAAGCTGGGCCACACCAATGCCAGC 

ATCATGCTATTTGGTCCAGGCGCCGTCAGCGAGGTGAGCAACGGCACGTCGAGGAGGGCAGG 

CTGCGTCTGGCTGCTGCCTCTTCTGGTCTTGCACCTGCTTCTCAAATTTTGATGTGAGTGCC 

ACTTCCCCACCCGGGAAAGGCTGCCGCCACCACCACCACCAACACAACAGCAATGGCAACAC 

CGACAGCAACCAATCAGATATATACAAATGAAATTAGAAGAAACACAGCCTCAT6GGACAGA 

AATTTGAGGGAGGGGAACAAAGAATACTTTGGGGGGAAAAGAGTTTTAAAAAAGAAATTGAA 

AATTGCCTTGCAGATATTTAGGTACAATGGAGTTTTCTTTTCCCAAACGGGAAGAACACAGC 

ACACCCGGCTTGGACCCACTGCAAGCTGCATCGTGCAACCTCTTTGGTGCCAGTGTGGGCAA 

GGGCTCAGCCTCTCTGCCCACAGAGTGCCCCCACGTGGAACATTCTGGAGCTGGCCATCCCA 

AATTCAATCAGTCCATAGAGACGAACAGAATGAGACCTTCCGGCCCAAGCGTGGCGCTGCGG 

GCACTTTGGTAGACTGTGCCACCACGGCGTGTGTTGTGAAACGTGAAATAAiyyVGAGCAT^ 

AAAAA 



FIGURE 222 



MKTIQPKMHNSISWAIFTGLJy^CLFQGVPVRSGDATFPKANroiSrTTWQGE 

WRVAWIJJRSTILYAGiroKWCLDPRVVLLSNTQTQYSIEIQNVDVYDEGP'XTCSVQTDNHPK 

TSRVHLIVQVSPKIVEISSDISINEGNNISLTCIATGRPEPTVTWRHISPKAVGFVSEDEYL 

EIQGITREQSGDYECSASNDVAAPVVRRVKWVNYPPYISEAKGTGVPVGQKGTLQCEASAV 

PSAEFQWYKDDKRIilEGKKGVKVEmPFLSKLIFFNVSEHDYGNYTCVASNKIiGH^ 

FGPGAVSEVSNGTSRRAGCVWLLPLLVLHLLLKF 



Signal peptide: 
amino acids 1-28 



FIGURE 223 



GAAAAAAAATCATG2\AAACCATCCAGCCAAAAATGCa,C::a^TTCTATCTCT 

ACGGGGCTGGCT6CTCTGTGTCTCTTCCAAGGAGTGCCCGTGCGCAGCGGAGATGCCACCTT 

CCCCAAAGCTATGGACAACGTGACGGTCCGGCAGGGGGAGAGCGCCACCCTCAGGTGCACTA 

TTGACAACCGGGTCACCCGGGTGGCCTGGCTAAACCGCAGCACCATCCTCTATGCTGGGAAT. 

GACAAGTGGTGCCTGGATCCTCGCGTGGTCCTTCTGAGCAACACCCAAACGCAGTACAGCAT 

CGAGATCCAGAACGTGGATGTGTATGACGAGGGCCCTTACACCTGCTCGGTGCAGACAGACA 

ACCACCCAAAGACCTCTAGGGTCCACCTCATTGTGCAAGTATCTCCCAAAATTGTAGAGATT 

TCTTCAGATATCTCCATTAATGAAGGGAACAATATTAGCCTCACCTGCATAGCAACTGGTAG 

ACCAGAG 



FIGURE 224 



ATGGC1X3GTG3VCGGCGGGGCCGGGCAGGGGACCGGGGCCGCGGCCCGGGAGCGGGCCAGCTGCCGGGAGCCCTGA 

ATCACaSCCTGGCCCGACTCCaCCaTOAACGTCGCGCTGCAGGAGCTGGGAGCTGGCykGCaAC^ 

AAGGGGACaaGACAGCTGTTAGGCTCAaKa^CGCAGCTGGAGCTGGTCTTAGCAGGTGCCT 

GCAerGCTTCTGGGCTGCCTTGTGGCCCTAGGGGTCCAGTACXlAa^GAGAC^ 

GAGGCOTGCATTaSAGTGGCTGGAAAAATCCTGGaVGTCCCTGGACCGAGGGGTGAGCCCC^ 

CAGTTCTCCTCTGGGGGCTGGATTCGGAGGAACCCCCTGCCCGATGGGCGTTCTCGCTGGaACaCCTTa^ 

CTCTGGGaCOiAAACCAGGCCATACTGaAGCACCTGCTTGAAAACACCaCCTTa^CTCC^ 

CAGAAGACACAGCGCTTCTACCTATCTTGCCTACAGGTGGAGCGCATTGAGGAGCTGGGAGCCCaGCCACT^^ 

GACCTCATTGAGAAGa.TTGGTGGTTGGAACATTACGGGGCCCTGGGACCAGGACAACTTTATGGAGGTGTTG^ 

GCAGTAGCAGGGACCTACS^GGGCCACCCCATTCTTCaCCGTCTACATCAGTGCCGACTCTJ^ 

AATGTTATCCAGGTGGACCAGTCTGGGCTCTTTCTGCCCTCTCGGGATTACTACTTAAACAGAACTGCCAATGAG 

AAAGTGCTCACTGCCTATCTGGATTAO^TGGAGGAACTGGGGATGCrGCTGGGTGGGCGGCCCaCCTCC^ 

GAGCft.GATGa«XAGGTGCrGGAGTTGQAGATACAGCrGGCCa^ 

GAGGAGaAGATCTACCAa^GATGaGCATTTCGGAGCTGCAGGCrCI^^ 

CroTCTTTCTTGCTGTO^CCATTGGAGTTGAGTGACTCTGAGCCTOTGGTGGTGTATGG^ 

CAGGTGTCAGAGCTCATC?^CCGCACGGAACCAAGCaTCCTGAACAATTACCTGATCTGG^ 

ACAACCTCT^GCCTGGACCGACGCTTTGAGTCTGCaCaAGAGAAGCTGCTGGAGACCCT 

TCCTGTGTGCCGAGGTGGCAGACCTGCATCTCCaACa^CGGATGACGCCCTTGGCTTTGCTTTGGGGTCACTCTTC 

GTGAAGGCCACGTTTGACCGGCAAAGCAAAGAjy\TTGCAGAGGGGATGATCAGCGAAATCCGGACCGaVTTTG^ 

GAGGCCCTGGGACAGCTGGTTTGGATGGATGAGAAGACCCGCCAGGCAGCCAAGGAGaAAGCAGATGCCATCTA 

GATATGATTGGTTTCCCAGACTTTATCCTGGAGCCCAAAGAGCTGGATGATGTTTATGACGGGTACGAAATTTCT 

GAAGATTCTTTCTTCCAAAACATGTTGAATTTGTAa^CTTCTCTGCCaiAGGTTATGGCTGACCAGCTCCGC^ 

CCTCCG^GCXIGAGAaZAGTGGAGCaTGACCCCCa^GACAfiTGAATGCCT 

6TCTTCCCCGCTGGCATCCTGCAGGCCCCCTTCTATGCX:CGa^CaVCCCXakAGGCCCTGAACT 

GGTGTGGTCATGGGCCATGAGTTGACGCATGCCTTTGATGACCaAGGGCGCGaGTATGACSUU^^ 

CGGCCCTGGTGGO^GAATGAGTCCCTGGO^CCTTCCGGAACCACACXSGCCTGCM^GGAGGAAC^ 

TACCAGGTCaATGGGGAGAGGCTCAACGGCCGCCAGACGCTGGGGGAGAACATTACTGACAACGGGGGGCTGAAG 

GCTGCCTACaATGCTTACAAAGOlTGGCTGAGAA^GCATGGGGAGQAGCAGCAACTGCCSk 

CTGGTGACCGACCCCCACAGCCCTGCCCGCTTCCGCGTGCTGGGCACTCTCTCCAACTCCCGTGACTTCCTGCGG 
CACTTCGGCTGCCCTGTCGGCTCCCCCATGAACCCAGGGCAGCTGTGTGAGGTGTG GTAGA CCTGGATCAGGGGA 
GAAATGGCaiGCTGTCaCCAGACCTGGGGCAGCTCTCCTGa.CSU^GCTGTTTGCTCTTGGGTTGG^^ 
ATCCS^GCTGGGCTGGGTCTAGTCCCTCCCCCCCa^(3VGGTGaCATGAGTACa^ 



GCCTGGAAGAGGTCnXSGGTGGGGAGGCCAGTTCCCaTAGGAAGGAGTCTGCC 



FIGURE 225 



MWALQELGAGSWGFQKGTRQLLGSRTQLELVLAGASLLLAl^LLGCLVALGVQYHRDPSH 
STCLTEACIRVAGKILESLDRGVSPCEDPYQFSCGGWIRKNPLPDGRSRWNTFNSLWDQNQA 
ILKHIiENTTFNSSSEAEQKTQRFYLSCLQVERIEELGAQPLRDLIEKIGGWNITGPWDQDN 
FMEVLKAVAGTYRATPFFTVYISADSKSSNSNVIQVDQSGLFLPSRDYYLNRTANEKVLTAY 
LDYMEELGMLLGGRPTSTREQMQQVLELEIQLANITVPQDQRRDEEKIYHKMS ISELQALAP 
SMDWLEFLS FLLSPLELSDSEPVWYGMDYIiQQVS EL INRTE P S I LNNYL I WNLVQKTTS S L 
DRRFESAQEKLLETLYGTKKSCVPRWQTCISNTDDALGFALGSLFVKATFDRQSKEIAEGMI 
SEIRTAFEEALGQLVWMDEKTRQAAKEKADAIYDMIGFPDFILEPKELDDVYDGYEISEDSF 
FQIOT^LNLYNFSAKVMADQLRKPPSRDQWSMTPQTVNAYYLPTKNEIVFPAGILQAPFYARNH 
PKALNFGGIGWMGHELTHAFDDQGREYDKEGNLRPWWQNESLAAFRNHTACMEEQYNQYQV 
NGERLNGRQTLGENITDNGGLKAAYNAYKAWLRKHGEEQQLPAVGLTNHQLFFVGPAQVWCS 
VRTPESSHEGLVTDPHSPARFRVLGTLSNSRDFLRHFGCPVGSPMNPGQLCEVW 



Type II Transmembrane domains 

amino acids 32-57 



FIGURE 226 

GCCa3GCCCTCCX3CCCTa::GCaCTCX:CGCCTCCCTCCCTCCGCCCGCTCCCGCGCCCTCCTCC^ 

CS^GCrGTCCCGTraXXSTCATGCCGAGCCTCCCGGCCCCGrcCGGCCCCGCTGCT^ 

CGGCTCCCGGCCGGCCCGCX3GCGCa3GCCC3VGAGCCCCCCGTGCTGCCCaTCCGT^ 

CGTTCGGGGAGCGGOVGGTAGGTGGGOSCCCXSGGGGaGGaKJGGGaSGGGaGTCGGGCT 

CaGCCCGGAGGGGGCGCX3GGGCGCaGGTGGCTa3GCGCGGCGGGCGGCCa3GAGGGTG^ 

GCGGTGCCTGGGACCCGGGACCCGCGGGCAGCCCCaSGGGCGGO^CACGGCGCGAGCTGGGCAGCGGCCTC 

OiAGCCCGTCCCCGCAGGCTGCy^CCTTCGGCGGGAAGGTCTATGCCTTGGACGAQACGTGGCACCCGGACCTAGG 

GGAGCCATTCGGGGTGATGCGCTGCGTGCTGTGCGCCTGCGAGGCGCAGTGGGGTCGCCGTACCAGGGGCCCTGG 

CaGGGTCAGCTGC?iAGAACATCaAACCAGAGTGCCCAACCCCGGCCTGTGGGCAGCCGCGCCAGCT^ 

CTGCTGCCAGACCTGCCCCCAGGACTTCGTGGCGCTGCTQACAGGGCCGAGGTCGCAGGCGGTGGCACGAGCCCG 

2W3TCTCXXTOCTGCGCTCTAGCCTCCGCTTCTCTATCTCCrACa.GGCGGCTGGACCGCCCTACC^ 

GCGGGO^TGCCTOSGTTGTCTCTGCGGCTCCTTAGGGCaGAAaWSCTGCATGTGGC^ 
CCCTTCa«3GGGAGGTCTGGGGGCCTCTCaTCCGGCaCCGGGCCCTGTCCCCaGAG&CC^ 
TCTAGAAGGCCCCO^COVGCAGGGCGTAGGGGGaVTCACCCTGCTa^CTCraWSTGAC^ 
TTTTTTGOTGOTCTTCCGAGGCCTTGOlGGACrAACCCaLGGl^CCCTTGAGGCT 

GCTACTGCGAGAACTTCAGGCCAATGTCrCaGCCCAGGAACCAGGCTTTGCTGAGGTGCTGCCCAACCTGACA^ 

CCaVGGAGATGGACTGGCTGGTGCTGGGGGAGCTGCAGATGGCCCTGGAGTGGGCAGGCAGGCCAGGGCTGCGCA 

CAGTGGACACATTGCTGCCAGGAAGAGCTGCGACGTCCTGCAAAGTGTCCTTTGTGGGGCTAATGCCCTGATCCC 

AGTCCSUVACGGGTGCTGCCGGCTCaGCCAGCCTCaLCTCTGCTAGGAAATGGaTCCCTGATCCTCCA^ 

GGTAGGGACAACCAGTGAGGTGGTGGCCATGACACTGGAiyiCCAAGCCTCAGCGGAGGGATCAGCC^ 

GTGCCACaTGGCTGGCCTATCCTCCCCTGCCCCS^WSGCCGTGGGTATCTGCCCTGGGCTGGGGTGCCCG^^ 

TO^TATGCTGCTGCAGAATGAGCTCTTCCTGAACGTGGGCACCAAGGACITCCCAGACGGAGAGCT 

ACGTGGCTGCCCTGCCX:TACTGTGGGGCaTAGCXK:CCGCCCTGCCCGTGa::CC^^ 

CCCTGTGaAGAGCa^GCAGCAGGGaiCGCCTGGCTTTCCTTGGATAa:a^CnX3TCAC^ 

GCTGGCTGGGCTTGGTGGCTCAGAACAAGGCACTGTCACTGCCCACCTCCrTGGGCCTCCTGGAACGCCAGGGCC 

TCGGCGGCTGCTGAAGGGATTCTATGGCTCAGAGGCCCAGGGTGTGGTGAAGGACCTGGAGCCGGAACTGCTGCG 

GCACCTGGCAAAAGGCaTGGCTTCCCTGATGATCyiCCACCAAGGTAGCCCCAGAGGGGAGCTCCGAGGGCAGCCT 

CTCCTCCCAGGTGa^CaTAGCCAACG?^TGTGAGGTTGGCGGACTGCGCCTGGAGGCGGCCGGGGCCGAGGGG^ 

GCGGGCGCTGGGGGCTCCGGATACAGCCTCTGCTGCGCCGCCTGTGGTGCCTGGTCTCCCGGCCCTAGCGCCCGC 

CaaACCTGGTGGTCCTGGGCGGCCCCGAGACCCCAACACATGCTTCTTCGAGGGGCAGCAGCGCCCCCACGGGGC 

TCGCTGCGCGCCCAACTACGACCCGCTCTGCTG^CTCTGCACCTGCCSIGAGACGAACGGT^^ 

GGTGTGCCCACCGCCCAGCTGCCCaCACCCGGTGCAGGCTCKCGACCaGTGCTGCCCTGTOT 

TTTTGATGGTGACCGGAGCTGGOSGGCaGCXSGGTACGCXSGTGGC^^ 

GTGTGCTGTCTGCS^CCTGCAAGCAGGGGGGCaCTGGAGAGGTGCACTGTGAQAAGGTGCaiGTGTCCC^ 

CTGTGCCCAGCCTGTGCGTGTCAACCCCACCGACTGCTOCaAACAGTGTCCAGGTGAGGCCCACCCC^^ 

GGACCCCATGCAGGCTGATGGGCCCCGGGGCTGCCGTTTTGCTGGGCAGTGGTTCCCAGAGAGTCAGAGCTGGC^ 

CCCCTCAGTGCCCCCGTTTGGAQAGATGAGCTGTATCACCTGCAGATGTGGGGTAAGTGGGGAGCAGAGGCTTGT 

GTGAGGTGG6TACTGGGAGCCTGGTCTGGAGTAGGGAGACCTTCCCAGGGAGGTCCCTGAAGAAGCTGAAGGTCA 

GGGATGACTGTTCACTGCCa.CTGTCCTGTGGCrCGGGQAAGGAGAGTCGATGCTGTTCCCGCTGGACGGCCC^^ 
GGCGGCGTAAGTGAGGGaGTCCAGGGTCa^GCaLGCTGTGAGTGGAGGGCTCACCTGCCTGTGGGACTCCTGATCAG 
GGAAGGQAGCACTCACroTGTGCaGGAACAGTGCaGCCTGCCTC^ 
ACCTGGTGGAATTGTTATTTATGACCrTTTCTTTACAAATGAGAT^ 

ATQAAGGTCACCGAGCTGTGTGCACTGACCrGTTTAGAAAATACrGGCCTTTCTGGGACKS^ 

TGCCOTGCCCTCTATGCOTCTCTGTGCCTCTCCACTCCCTOTCCCCTCCTCCaACATTCCCTCCCTTCTGTCTCC 

AGCAGCCCCAGAGACCAGAACTGATCCAGAGCTGGAGAA^GAAGCCGAAGGCrCTTAGGGAGCAGCCAGAGG^ 

AAGTGACCAAGAGGATGGGGCCTGAGCTGGGGAAGGGGTGGCATCGAGQACCrTCTTGCATTCTCCTGTGGGAAG 

CCCAGTGCCTTTGCTCCTCTGTCCTGCCTCTACTCCCACCCCCACTACCTCTGGGAACCACAGCTCCACAAGGGG 

GAGAGGCAGCTGGGCa^GACCGAGGTCACSVGCCACTCCZ^GTCCTGCCCTGCCACCCTCGGCCTCTGTCCTGGI^ 

GCCCCaCCCCTTTCTTCCTGTACATAATGTCACTGGCTTGTTGGGATTTTTAATTTATCTTCACTCAGCACC^ 

GGCCCCGGACACTCCaCTCCTGCTOCCCCTGAGCTGAGCTkGAGTCATTATTGGAGAGTTTTGTATTTATTAAAAC 

ATTTCTTTTTOWSTCTTTGGGCATGAGGTTGGCTCTTTGTGGCCAGGAACCTGAGTGGGGCCT 

GGNGAGAGTAGGAGGTGAGAGAGAGGAGCTCTGACaCrTGGGGAGCTGaAAGAG^ 

CGTGGCmTTGGCTGGCS^TNCCTGGGTTCCGCSVGAGGGGCTGGGGATGGTTCOT^ 

AArrTAGGGAAGTAGAAGCAGGATTTTGACTCAaGTTTAGTTTCCaiCaTCGCTGGCCTC^ 
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GGCCGA6CGGGGGTGCTGCGCGGCGGCCGTGATGGCTGGTGACGGCGGGGCCGGGCAGGGGA 
CCGGGGCCGCGGCCCGGGAGCGGGCCAGCTGCCGGGAGCCCTGAATCACCGCCTGGCCCGAC 
TCCACCATGAACGTCGCGCTGCAGGAGCTGGGAGCTGGCAGCAACGTGGGATTCCAGAAGG6 
GACAAGACAGCTGTTAGGCTCACGCACGCAGCTGGAGCTGGTCTTAGCa.GGTGCCTCTCTAC 
TGCTGGCTGCACTGCTTCTGGGCTGCCTTGTGGCCCTAGGGGTCCAGTACCACAGAGACCCA 
TCCCACAGCACCTGCCTTACAGAGGCCTGCATTCGAGTGGCTGGAAAAATCCTGGAGTCCCT 
GGACCGAGGGGTGAGCCCCTGTGAGGACTTTTACCAGTTCTCCTGTGGGGGCTGGATTCGGA 
GGAACCCCCTGCCCGATGGGCGTTCTCGCTGGAACACCTTCAACAGCCTCTGGGACCAAAAC 
CAGGCCATACTGAAGCACCTGCTTGAAAACACCACCTTCAACTCCAGCAGTGAAGCTGAGCA 
GAAGACACAGCGCTTCTACCTATCTTGCCTACAGGTGGAGCGCATTGAGGAGCTGGGAGCCC 
AGCCACTGAGAGACCTCATTGAGAAGATTGGTGGTTGGAACATTACGGGGCCCTGGGACCAG 
GACAACTTTATGGAGGTGTTGAAGGCAGTAGCAGGGACCTACAGGGCCACCCCATTCTTCAC 
CGTCTACATCAGTGCCGACTCTAAGAGTTCCAACAGCAATGTTATCCAGGTGGACCAGTCTG 
GGCTCTTTCTGCCCTCTCGGGATTACTACTTAAACAGAACTGCCAATGAGAAAGTAAGGAAC 
ATCTTCCGAACCCCCATCCCTACCCCTGGCTGAGCTGGGCTQATCCCTGTTGACTTTTCCCT 
TTGCCAAGGGTCAGAGCAGGGAAGGTGAGCCTATCCTGTCACCTAGTGAACAAACTGCCCCT 
CCTTTCTTTCTTCTTTTCTTCCTCCCTCCCTCCCTTTCTTCCCCTTTTCCTTCCTTCCTTCC 
TCTTATTCTTCTAGTAGGTTTCATAGACACCTACTGTGTGCCAGGTCCAGTGGGGGAATTCG 
GAGATATAAGTTTCCGAGCCATTGCCACAGGAAGCGTTCAGTGTCGATGGGTTCATGGACCT 
AGATAGGCTGATAACAAAGCTCACAAGAGGGTCCTGAGGATTCAGGAGAGACTTATGGAGCC 
AGCAAAGTCTTCCTGAAGAGATTGCATTTGAGCCAGGTCCTGTAG 
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ATGCCTACTACCTTCCAACTAAGAATGAGATCGTCTTCCCCGCTGGCATCCTGCAGGCCCCC 
TTCTATGCCCGCAACCACCCCAAGGCCCTGAACTTCGGTGGCATCGGTGTGGTCATGGGCCA 
TGAGTTGACGCATGCCTTTGATGACCAAGGGCGCGAGTATGACAAAGAAGGGAACCTGCGGC 
CCTGGTGGCAGAATGAGTCCCTGGCAGCCTTCCGGAACCACACGGCCTGCATGGAGGAACAG 
TACAATCAATACCAGGTCAATGGGGAGAGGCTCAACGGCCGCCAGACGCTGGGGGAGAACAT 
TGCTGACAACGGGGGGCTGAAGGCTGCCTACAATGCTTACAAAGCATGGCTGAGAAAGCATG 
GGGAGGAGCAGCAACTGCCAGCCGTGGGGCTCACCAACCACCAGCTCTTCTTCGTGGGATTT 
GCCCAGGTGTGGTGCTCGGTCCGCACACCAGAGAGCTCTCACGAGGGGCTGGTGACCGACCC 
CCACAGCCCTGCCCGCTTCCGCGTGCTGGGCACTCTCTCCAACTCCCGTGACTTCCTGCGGC 
ACTTCGGCTGCCCTGTCGGCTCCCCCATGAACCCAGGGCAGCTGTGTGAGGTGTGGTAGACC 
TGGATCAGGGGAGAAATGGCCAGCTGTCACCAGACCTGGGGCAGCTCTCCTGACAAAGCTGT 
TTGCTCTTGGGTTGGGAGGAAGCAAATGCAAGCTGGGCTGGGTCTAGTCCCTCCCCCCCACA 
GGTGACATGAGTACAGACCCTCCTCAATCACCACATTGTGCCTCTGCTTTGGGGGTGCCCCT 
GCCTCCAGCAGAGCCCCCACCATTCACTGTGACATCTTTCCGTGTCACCCTGCCTGGAAGAG 
GTCTGGGTGGGGAGGCCA6TTCCCATAGGAAGGAGTCTGCCTCTTCTGTCCCCAGGCTCACT 
CAGCCTGGCGGCCATGGGGCCTGCCGTGCCTGCCCCACTGTGACCCACAGGCCTGGGTGGTG 
TACCTCCTGGACTTCTCCCCAGGCTCACTCAGTGCGCACTTAGGGGTGGACTCAGCTCTGTC 
TGGCTCACCCTCACGGGCTACCCCCACCTCACCCTGTGCTCCTTGTGCCACTGCTCCCAGTG 
CTGCTGCTGACCTTCACTGACAGCTCCTAGTGGAAGCCCAAGGGCCTCTGAAAGCCTCCTGC 
TGCCCACTGTTTCCCTGGGCTGAGAGGGGAAGTGCATATGTGTAGCGGGTACTGGTTCCTGT 
GTCTTAGGGCACAAGCCTTAGCAAATGATTGATTCTCCCTGGACAAAGCAGGAAAGCAGATA 
GAGCAGGGAAAAGGAAGAACAGAGTTTATTTTTACAGAAAAGAGGGTGGGAGGGTGTGGTCT 
TGGCCCTTATAGGACC 



FIGURE 229 

CCCACGCGTCCGAGCCGCCCGAGAATTAGACACACTCCGGACGCGGCCAAAAGCAACCGAGA 
GGAGGGGAGGCAAAAACACCGAAAAACAAAAAGAGAGAAACAACACCCT^CAACTGGGGTGG 
GGGGAAGAAAGAAAGAAAAGAAACCCACCCACCCACCAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAATCCTGTGGCGCGCCGCCTGGTTCCCGGGAAGACTCGCCAGCACCAGGGGG 
TGGGGGAGTGCGAGCTGAAAGCTGCTGGAGAGTGAGCAGCCCTAGCAGGGATGGACATGATG 
CTGTTGGTGCAGGGTGCTTGTTGCTCGAACCAGTGGCTGGCGGCGGTGCTCCTCAGCCTGTG 
CTGCCTGCTACCCTCCTGCCTCCCGGCTGGACAGAGTGTGGACTTCCCCTGGGCGGCCGTGG 
ACAACATGATGGTCAGAAAAG6GGACACGGCGGTGCTTAGGTGTTATTTGGAAGATGGAGCT 
TCAAAGGGTGCCTGGCTGAACCGGTCAAGTATTATTTTTGCGGGAGGTGATAAGTGGTCAGT 
GGATCCTCGAGTTTCAATTTCAACATTGAATAAAAGGGACTACAGCCTCCAGATACAGAATG 
TAGATGTGACAGATGATGGCCCATACACGTGTTCTGTTCAGACTCAACATACACCCAGAACA 
ATGCAGGTGCATCTAACTGTGCAAGTTCCTCCTAAGATATATGACATCTCAAATGATATGAC 
CGTCAATGAAGGAACCAACGTCACTCTTACTTGTTTGGCCACTGGGAAACCAGAGCCTTCCA 
TTTCTTGGCGACACATCTCCCCATCAGCAAAACCATTTGAAAATGGACAATATTTGGACATT 
TATGGAATTACAAGGGACCAGGCT6GGGAATATGAATGCAGTGCGGAAAATGCTGTGTCATT 
CCCAGATGTGAGGAAAGTAAAAGTTGTTGTCAACTTTGCTCCTACTATTCAGGAAATTAAAT 
CTGGCACCGTGACCCCCGGACGCAGTGGCCTGATAAGATGTGAAGGTGCAGGTGTGCCGCCT 
CCAGCCTTTGAATGGTACAAAGGAGAGAAGAAGCTCTTCAATGGCCAACAAGGAATTATTAT 
TCAAAATTTTAGCACAAGATCCATTCTCACTGTTACCAACGTGACACAGGAGCACTTCGGCA 
ATTATACCTGTGTGGCTGCCAACAAGCTAGGCACAACCAATGCGAGCCTGCCTCTTAACCCT 
CCAAGTACAGCCCAGTATGGAATTACCGGGAGCGCTGATGTTCTTTTCTCCTGCTGGTACCT 
TGTGTTGACACTGTCCTCTTTCACCAGCATATTCTACCTGAAGAATGCCATTCTACAATAAA 
TTCAAAGACCCATAAAAGGCTTTTAAGGATTCTCTGAAAGTGCTGATGGCTGGATCCAATCT 
GGTACAGTTTGTTAAAAGCAGCGTGGGATATAATCAGCAGTGCTTACATGGGGATGATCGCC 
TTCTGTAGAATTGCTCATTATGTAAATACTTTAATTCTACTCTTTTTTGATTAGCTACATTA 
CCTTGTGAAGCAGTACACATTGTCCTTTTTTTAAGACGTGAAAGCTCTGAAATTACTTTTAG 
AGGATATTAATTGTGATTTCATGTTTGTAATCTACAACTTTTCAAAAGCATTCAGTCATGGT 
CTGCTAGGTTGCAGGCTGTAGTTTACAAAAACG2^TATTGCAGTGAATATGT6ATT.CTTTAA 
GGCTGCAATACAAGCATTCAGTTCCCTGTTTOiATAAGAGTCaATCCACATTTACAAAGATG 
CATTTTTTTCTTTTTTGATAAAAAAGCAAATAATATTGCCTTCAGATTATTTCTTCAAAATA 
TAACACATATCTAGATTTTTCTGCTTGCATGATATTCAGGTTTCAGGAATGAGCCTTGTAAT 
ATAACTGGCTGTGCAGCTCTGCTTCTCTTTCCTGTAAGTTCAGCATGGGTGTGCCTTCATAC 
AATAATATTTTTCTCTTTGTCTCCAACTAATATAAAATGTTTTGCTAAATCTTACAATTTGA 
AAGTAAAAATAAACCAGAGTGATCAAGTTAAACCATACACTATCTCTAAGTAACGAAGGAGC 
TATTGGACTGTAAAAATCTCTTCCTGCACTGACAATGGGGTTTGAGAATTTTGCCCCACACT 
AACTCAGTTCTTGTGATGAGAGACAATTTAATAACAGTATAGTAAATATACCATATGATTTC 
TTTAGTTGTAGCTAAATGTTAGATCCACCGTGGGAAATCATTCCCTTTAAAATGACAGCACA 
GTCCACTCAAAGGATTGCCTAGCAATACAGCATCTTTTCCTTTCACTAGTCCAAGCCAAAAA 
TTTTAAGATGATTTGTCAGAAAGGGCACAAAGTCCTATCACCTAATATTACAAGAGTTGGTA 
AGCGCTCATCATTAATTTTATTTTGTGGCAGGTATTATGACAGTCGACCTGGAGGGTATGGA 
TATGGATATGGACGTTCCAGAGACTATAATGGCAGAAACCAGGGTGGTTATGACCGCTACTC 
AGGAGGAAATTACAGAGACAATTATGACAACTGAAATGAGACATGCACATAATATAGATACA 
CAAGGAATAATTTCTGATCCAGGATCGTCCTTCCAAATGGCTGTATTTATAAAGGTTTTTGG 
AGCTGCACTGAAGCATCTTATTTTATAGTATATCAACCTTTTGTTTTTAAATTGACCTGCCA 
AGGTAGCTGAAGACCTTTTAGACAGTTCCATCTTTTTTTTTAAATTTTTTCTGCCTATTTAA 
AGACAAATTATGGGACGTTTGTCAAAAAAAAAAAAAAAAAAAAAAASAAA 



FIGURE 230 



MMLLVQGACCSNQWIJ^VLLSLCCLLPSCLPAGQSVDFPWAAVDNMMVRKGDTAVLRCyLED 
GASKGAWLNRSS 1 1 FAGGDKWSVDPRVS I STLNKRDYSLQIQNVDVTDDGPYTCSVQTQHTP 
RTMQVHLTVQVPPKI YDISNDMTVNEGTNVTLTCLATGKPEPS I SWRHI S PSAKPFENGQYL 
DIYGITRDQAGEYECSAENAVSFPDVRKVKVVVNFAPTIQEIKSGTVTPGRSGLIRCEGAGV 
PPPAFEWYKGEKKLFNGQQGIIIQNFSTRSILTVTNVTQEHFGNYTCVAANKLGTTNASLPL 
NPPSTAQYGITGSADVLFSCWYLVLTLSSFTSIFYLKNAILQ 

Important features of the protein: 
Signal peptide: 
amino acids 1-31 

Transmembrane domain: 

amino acids 326-345 

N-glycosylation sites. 

amino acids 71-75, 153-157, 273-277, 284-288, 292-296, 305-309 

Casein kinase II phosphorylation site, 
amino acids 147-151, 208-212, 224-228 

Tyrosine kinase phosphorylation site. 

amino acids 178-186 

N-myristoylation sites. 

amino acids 7-13, 63-70, 67-73, 151-157, 239-245, 291-297, 
302-308, 319-325 

Myelin PO protein: 

amino acids 92-121 



FIGURE 231 



AGTGGTTCGATGGGAAGGATCTTTCTCCAAGTGGTTCCTCTTGAGGGGAGCATTTCTGCTGG 
CTCCAGGACTTTGGCCATCTATAAAGCTTGGC AATGA GAAATAAGAAAATTCTCAAGGAGGA 
CGAGCTCTTGAGTGAGACCCAACAAGCTGCTTTTCACCAAATTGCAATGGAGCCTTTCGAAA 
TCAATGTTCCAAAGCCCAAGAGGAGAAATGGGGTGAACTTCTCCCTAGCTGTGGTGGTCATC 
TACCTGATCCTGCTCACCGCTGGCGCTGGGCTGCTGGTGGTCCAAGTTCTGAATCTGCAGGC 
GCGGCTCCGGGTCCTGGAGATGTATTTCCTCAATGACACTCTGGCGGCTGAGGACAGCCCGT 
CCTTCTCCTTGCTGCAGTCAGCACACCCTGGAGAACACCTGGCTCAGGGTGCATCGAGGCTG 
CAAGTCCTGCAGGCCCAACTCACCTGGGTCCGCGTCAGCCATGAGCACTTGCTGCAGCGGGT 
AGACAACTTCACTCAGAACCCAGGGATGTTCAGAATCAAAGGTGAACAAGGCGCCCCAGGTC 
TTCAAGGTCACAAGGGGGCCATGGGCATGCCTGGTGCCCCTGGCCCGCCGGGACCACCTGCT 
GAGAAGGGAGCCAAGGGGGCTATGGGACGAGATGGAGCAACAGGCCCCTCGGGACCCCAAGG 
CCCACCGGGAGTCAAGGGAGAGGCGGGCCTCCAAGGACCCCAGGGTGCTCCAGGGAAGCAAG 
GAGCCACTGGCACCCCAGGACCCCAAGGAGAGAAGGGCAGCAAAGGCGATGGGGGTCTCATT 
GGCCCAAAAGGGGAAACTGGAACTAAGGGAGAGaAAGGAGACCTGGGTCTCCCAGGAAGCAA 
AGGGGACAGGGGCATGAAAGGAGATGCAGGGGTCATGGGGCCTCCTGGAGCCCAGGGGAGTA 
AAGGTGACTTCGGGAGGCCAGGCCCACCAGGTTTGGCTGGTTTTCCTGGAGCTAAAGGAGAT 
CAAGGACAACCTGGACTGCAGGGTGTTCCGGGCCCTCCTGGTGCAGTGGGACACCCAGGTGC 
CAAGGGTGAGCCTGGCAGTGCTGGCTCCCCTGGGCGAGCAGGACTTCCAGGGAGCCCCGGGA 
GTCCAGGAGCCACAGGCCTGAAAGGAAGCAAAGGGGACACAGGACTTCAAGGACAGCAAGGA 
AGAAAAGGAGAATCAGGAGTTCCAGGCCCTGCAGGTGTGAAGGGAGAACAGGGGAGCCCAGG 
GCTGGCAGGTCCCAAGGGAGCCCCTGGACAAGCTGGCCAGAAGGGAGACCAGGGAGTGAAAG 
GATCTTCTGGGGAGCAAGGAGTAAAGGGAGAAAAAGGTGAAAGAGGTGAAAACTCAGTGTCC 
GTCAGGATTGTCGGCAGTAGTAACCGAGGCCGGGCTGAAGTTTACTACAGTGGTACCTGGG6 
GACAATTTGCGATGACGAGTGGCAAAATTCTGATGCCATTGTCTTCTGCCGCATGCTGGGTT 
ACTCCAAAGGAAGGGCCCTGTACAAAGTGGGAGCTGGCACTGGGCAGATCTGGCTGGATAAT 
GTTCAGTGTCGGGGCaLCGGAGAGTACCCTGTGGAGCTGCACCAAGAATAGCTGGGGCC?lTC^ 
TGACTGCAGCCACGAGGAGGACGCAGGCGTGGAGTGCAGCGTCTGACCCGGAAACCCTTTCA 
CTTCTCTGCTCCCGAGGTGTCCTCGGGCTCATATGTGGGAAGGCAGAGGATCTCTGAGGAGT 
TCCCTGGGGACAACTGAGCAGCCTCTGGAGAGGGGCCATTAATAAAGCTCAACATCATTGA 



FIGURE 232 

></usr/seqdb2/sst/DNA/Dnaseqs . full/ss .DNA68886 
xsiibiinit 1 of 1, 520 aa, 1 stop 
xMW: 52658, pi: 9.16, 3SIX(S/T): 3 

MRNKKILKEDELLSETQQAAFHQIiy^EPFEIWPKPKRRNGVNFSLAVVVIYLILLTAGAGL 
LWQVLNLQARLRVLEMYFLNDTLAAEDSPSFSLLQSAHPGEHLAQGASRLQVLQAQLTWVR 
VSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRD 
GATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGET6TKGE 
KGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPG 
PPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGPA 
GVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGR 
AEVYYSGTWGTICDDEWQNSDAIVFCRMLGYSKGRALYKVGAGTGQIWLDNVQCRGTESTLW 
SCTKNSWGHHDCSHEEDAGVECSV 

Transmembrane domain: 
amino acids 47-66 (type II) 

N-glycosylation sites. 

amino acids 43-47, 83-87, 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 432-440 

N-myristoylation sites. 

amino acids 41-47, 178-184, 253-259, 274-280, 340-346, 346-352, 
400-406, 441-447, 475-481, 490-496, 515-521 

Amidation site. 

amino acids 360-364 

Leucine zipper pattern, 
amino acids 56-78 

Speract receptor repeat 
amino acids 422-471, 488-519 

Clq domain proteins. 

amino acids 151-184, 301-334, 316-349 
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CCCACGCGTCCGAAGGCAGACAAAGGTTCATTTGTAAAGAAGCTCCTTCCAGCACCTCCTCT 
CTTCTCCTTTTGCCCaAACTCACCCAGTGAGTGTGAGCATTTAAGAAGCATCCTCTGCCAAG 
ACCAAAAGGAAAGAAGAAAAAGGGCCAAAAGCO^A AATGA AACTGATGGTAgTTGTTTTr'Ar 
CATTGGGCTAACTTTGCTGCTAGGAGTTCAAGCC^TGCCTGCAAATCGCCTCTCTTGCTACA 
GAAAGATACTAAAAGATCACAACTGTCACAACCTTCCGGAAGGAGTAGCTGACCTGACACAG 
ATTGATGTCAATGTCCAGGATCATTTCTGGGATGGGAAGGGATGTGAGATGATCTGTTACTG 
CAACTTCAGCGAATTGCTCTGCTGCCCAAAAGACGTTTTCTTTGGACCAAAGATCTCTTTCG 
TGATTCCTTGCAACAATCAATGAGAATCTTCATGTATTCTGGAGAACACCATTCCTGATTTC 
CCACAAACTGCACTACATCAGTATAACTGCATTTCTAGTTTCTATATAGTGCAATAGAGCAT 
AGATTCTATAAATTCTTACTTGTCTAAGACAAGTAAATCTGTGTTAAACAAGTAGTAATAAA 
AGTTAATTCAATCTAAAAAAAAAAAAA 



FIGURE 234 



</usr/seqclb2/sst/DNA/Dnaseqs .min/ss .DNA52758 
<siibunit 1 of 1, 98 aa, 1 stop 
<MW: 11081, pi: 6.68, NX(S/T) : 1 

MKLIWLVFTIGLTLLLGVQAMPANRLSCYRKILKDHNCHNLPEGVADLTQIDVNVQDHFWDG 
KGCEMICYCNFSELLCCPKDVFPGPKISFVIPCNNQ 

Importsait features: 
Signal peptide: 

amino acids 1-20 

G 

N-glycosylation site. 

Si amino acids 72-76 

a 

m 

U: Tyrosine kinase phosphorylation site, 
amino acids 63-71 
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CCCACGCGTCCGCGGACGCGTGGGCTGGACCCCAGGTCTGGAGCGAATTCCAGCCTGCAGGG 

CTGATAAGCGAGGCATTAGTGAGATTGAGAGAGACTTTACCCCGCCGTGGTGGTTGGAGGGC 

GCGCAGTAGAGCAGCAGCACAGGCGCGGGTCCCGGGAGGCCGGCTCTGCTCGCGCCGA GATG 

TGGAATCTCCTTCACGAAACCGACTCGGCTGTGGCCACCGCGCGCCGCCCGCGCTGGCTGTG 

CGCTGGGGCGCTGGTGCTGGCGGGTGGCTTCTTTCTCCTCGGCTTCCTCTTCGGGTGGTTTA 

TAAAATCCTCCAATGAAGCTACTAACATTACTCCAAAGCATAATATGAAAGCATTTTTGGAT 

GAATTGAAAGCTGAGAACATC^GAAGTTCTTACATAATTTTACACAGATACCACATTT^ 

AGGAACAGAACAAAACTTTCAGCTTGCAAAGCAAATTCAATCCCAGTGGAAAGAATTTGGCC 

TGQATTCTGTTGAGCTAGCTCATTATGATGTCCTGTTGTCCTACCCAAATAAGACTCATCCC 

AACTACATCTCAATAATTAATGAAGATGGAAATGAGATTTTCAACACATCATTATTTGAACC 

ACCTCCTCCAGGATATGAAAATGTTTCGGATATTGTACCACCTTTCAGTGCTTTCTCTCCTC 

AAGGAATGCCAGAGGGCGATCTAGTGTATGTTAACTATGCACGAACTGAAGACTTCTTTl^ 

TTGGAACGGGACATGAAAATCAATTGCTCTGGGAAAATTGTAATTGCCAGATATGGGAAAGT 

TTTCAGAGGAAATAAGGTTAAAAATGCCCAGCTGGCAGGGGCCAAAGGAGTCATTCTCTACT 

CCGACCCTGCTGACTACTTTGCTCCTGGGGTGAAGTCCTATCCAGACGGTTGGAATCTTCCT 

GGAGGTGGTGTCCAGCGTGGAAATATCCTAAATCTGAATGGTGCAGGAGACCCTCTCACACC 

AGGTTACCCAGCAAATGAATATGCTTATAGGCGTGGAATTGCAGAGGCTGTTGGTCTTCCAA 

GTATTCCTGTTCATCCAATTGGATACTATGATGCACAGAAGCTCCTAGAAAAAATGGGTGGC 

TCAGCACCACCAGATAGCAGCTGGAGAGGAAGTCTCAAAGTGCCCTACAATGTTGGACCTGG 

CTTTACTGGAAACTTTTCTACACAAAAAGTCAAGATGCACATCCACTCTACCAATGAAGTGA 

CGAGAATTTACAATGTGATAGGTACTCTCAGAGGAGCAGTGGAACCAGACAGATATGTCATT 

CTGGGAGGTCACCGGGACTCATGGGTGTTTGGTGGTATTGACCCTCAGAGTGGAGCAGCTGT 

TGTTCATGAAATTGTGAGGAGCTTTGGAACACTGAAAAAGGAAGGGTGGAGACCTAGAAGAA 

CAATTTTGTTTGCAAGCTGGGATGCAGAAGAATTTGGTCTTCTTGGTTCTACTGAGTGGGCA 

GAGGAGAATTCAAGACTCCTTCAAGAGCGTGGCGTGGCTTATATTAATGCTGACTCATCTAT 

AGAAGGAAACTACACTCTGAGAGTTGATTGTACACCGCTGATGTACAGCTTGGTACACAACC 

TAACAAAAGAGCTGAAAAGCCCTGATGAAGGCTTTGAAGGCAAATCTCTTTATGAAAGTTGG 

ACTAAAAAAAGTCCTTCCCCAGAGTTCAGTGGCATGCCCAGGATAAGCAAATTGGGATCTGG 

AAATGATTTTGAGGTGTTCTTCCAACGACTTGGAATTGCTTCAGGCAGAGCACGGTATACTA 

AAAATTGGGAAACAAACAAATTCAGCGGCTATCCACTGTATCACAGTGTCTATGAAACATAT 

GAGTTGGTGGAAAAGTTTTATGATCCAATGTTTAAATATCACCTCACTGTGGCCCAGGTTCG 

AGGAGGGATGGTGTTTGAGCTAGCCAATTCCATAGTGCTCCCTTTTGATTGTCGAGATTATG 

CTGTAGTTTTAAGAAAGTATGCTGACAAAATCTACAGTATTTCTATGAAACATCCACAGGAA 

ATGAAGACATACAGTGTATCATTTGATTCACTTTTTTCTGCAGTAAAGAATTTTACAGAAAT 

TGCTTCCAAGTTCAGTGAGAGACTCCAGGACTTTGACAAAAGCAACCCAATAGTATTAAGAA 

TGATGAATGATCAACTCATGTTTCTGGAAAGAGCATTTATTGATCCATTAGGGTTACCAGAC 

AGGCCTTTTTATAGGCATGTCATCTATGCTCCAAGCAGCCACAACAAGTATGCAGGGGAGTC 

ATTCCCAGGAATTTATGATGCTCTGTTTGATATTGAAAGCAAAGTGGACCCTTCCAAGGCCT 

GGGGAGAAGTGAAGAGACAGATTTATGTTGCAGCCTTCACAGTGCAGGCAGCTGCAGAGACT 

TTGAGTGAAGTAGCCTAAGAGGATTTTTTAGAGAATCCGTATTGAATTTGTGTGGTATGTCA 

CTCAGAAAGAATCGTAATGGGTATATTGATAAATTTTAAAATTGGTATATTTGAAATAAAGT 

TGAATATTATATATAA 
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></usr/seqdb2/sst/DNA/Dnaseqs . full/ss .DNA52756 
xsiibianit 1 of 1, 750 aa, 1 stop 
xMW: 84305, pi: 6.93, NX(S/T) : 10 

MWNLLHETDSAVATARRPRWLCAGALVLAGGFFLLGFLFGWFIKSSNEATNITPKHNMK^ 

DELKAENIKKFLIINFTQIPHLAGTEQNFQLAKQIQSQWKEFGIjDSVELAHYDVLLSyPNKTH 

PNYISIINEDGNEIFNTSLFEPPPPGYENVSDIVPPFSAFSPQGMPEGDLVYVNYARTEDFF 

KLERDMKINCSGKIVIARYGKVFRGNKVKNAQLAGAKGVILYSDPADYFAPGVKSYPDGWNL 

PGGGVQRGNILNLNGAGDPLTPGYPANEYAYRRGIAEAVGLPSIPVHPIGYYDAQKLLEKMG 

GSAPPDSSWRGSLKVPYNVGPGFTGNFSTQKVKMHIHSTNEVTRIYNVIGTLRGAVEPDRYV 

ILGGHRDSWVFGGIDPQSGAAWHEIVRSFGTLKKEGWRPRRTILFASWDAEEFGLLGSTEW 

AEENSRLLQERGVAYINADSSIEGNYTLRYDCTPLMYSLVHNLTKELKSPDEGFEGKSLYES 

WTKKSPSPEFSGMPRISKLGSGNDFEVFFQRLGIASGRliRYTKNWETNKFSGYPLYHSVYET 

YELVEKFYDPMFKYHLTVAQVRGGMVFELANSIVLPFDCRDYAVVLRKYADKIYSISMKHPQ 

EMKTYSVSFDSLFSAVKNFTEIASKFSERLQDFDKSNPIVLRMMNDQLMFLERAFIDPLGLP 

DRPFYRHVIYAPSSHNKyAGESFPGIYD2U:.FDIESKVDPSKAWGEVKRQIYVAAPTVQAAAE 

TLSEVA 

Signal sequence; 
amino acids 1-40 

N-glycosylation sites. 

amino acids 76-80, 121-125, 140-144, 153-157, 195-199, 336-340, 
459-463, 476-480, 638-642 

Tyxoslne kinase phosphorylation sites. 

amino acids 363-372, 605-613, 606-613, 617-626 

N-myristoylation sites. 

amino acids 85-91, 168-174, 252-258, 256-262, 282-288, 335-341, 
360-366, 427-433, 529-535, 707-713 



SuperFect (Qiagen) and pulse-labeled for 3 hours with [^S]melhioniiie and pQcysteine. Bofli epitope-tagged 
proteins co-migrate when 20 mic^liters of 15-foId concentrated serum-free conditioned mednmi were 
electtophoresed on a polyacrylamide gd (Novex) in sodium dodecyl sul£ate mnple buffer (SDS-PAGE). The 
VEGF-E-IgG expression plasmid was constructed by cloning the ORF in firont of the human Fc QgS) sequence. 

The VEGF-E-IgG plasmid was co-transfected with Baculogold Baculovuus DNA OE'harmingen) using 
Lipofectin (GibcoBRL) into 10* Sf9 ceUs grown in Hmk's TNM-FH medium (JRH Biosciences) st^lemented 
with 10% fetal bovine serum. Cells w«e incubated for 5 days at 28''C. The STq)ematant was harvested and 
subsequently used for flie first viral amplification by infecting Sf9 cells at an appiOTdmate multiplicity of infection 
(MOI) of 10. Cells were incubated for 3 days, flien supernatant harvested, and egression of the recombinant 
plasmid determined by binding of 1 ml of supernatant to 30 (il of Protein-A Sepharose CL^B beads (Riaimacia) 
followed by subsequent SDS-PAGE analysis. The first amplification stqiematant was used to infect a 500 ml 
spinner culture of Sf9 cells grown in ESF-921 medium (E}¥ression Systems IXC) at an jqjproximate MOI of 
0.1. Cellsweretrealedasabove,exceptharvestedsupematantwassterile filtered. Specific protein was purified 
by binding to Protein-A Sepharose 4 Fast Flow (Hiarmacia) column. 

EXAMPLE 86 : Northern Blot Analvses for PRQ2Q0 

Blots of human poly(A) + RNA from multiple adult aixl fetal tissues and tumor cell lines were obtained 
from Clontedi (Palo Alto, CA). Hybridization was carried out using ^-labded probes containing the entire 
coding region and washed in 0. 1 x SSC, 0.1 % SDS at eS'C. 

VEGF-E mSNA was detectable in fetal lung, kidney, bndn, liver and adult heart, placenta, liver, 
skeletal muscle, kidneys, and pancreas. VEGF-E mSNA was also found in AS49 lung adenocarcinoma and HeLa 
cervical adenocarcinoma cell lines. 

EXAMPLE 87: In Situ Hy bridization of Human Fetal Tissue Sections for PRO20Q 

Formaliurfixed, parafSn-embeddedhioaan fetal bram, liver, lower limb, small mtestine, thyroid, lymph 
node, diymus, stomadi, trachea, skin, spleen, spinal cord, adrenal, placenta, cord, and adult liver, pancreas, 
lung, spleen, lymph node, adrenal, heart, aorta, and skin were sectioned, deparaffinized, deproteinated in 
proteinase K (20 //g/ml) for 15 mmules at 37°C, and farther processed for in situ hybridization as described by 
Lu LH and Gillett NA (Cell Vision 1: 169-176, 1994). A [a-^^-PJUTP-Iabeled antisense riboprobe was generated 
from a PCR product of 980 bp (primers GGCGGAATCCAACCTGAGTAG and 
GCGGCTATCCTCCTGTGCTC, SEQ ID NOS: 493 and 494, respectively). The sKdes were dipped in Kodak 
NTB2 nuclear track emulsion and exposed for 4 weeks. 

VEGF-E mRNA expression included localization at the growth plate region and embracing fetal 
nqrotqrtes. 

EXAMPLE 88: Mvocvte Hvoertrophv Assay for PRQ200 

Myocytes firom neonatal Harlan Sprague Dawley rat heart ventricle (23 days gestation) were plated in 
dupUcate at 75000 cells/ml in a 96-weU plate. CeHs were treated for 48h with 2000, 200. 20, or 2 ng/ml 
VEGF-E-IgG. Myocytes were stained with crystal viol^ to visualize morphology and scored on a scale of 3 to 
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7, 3 being nonstimulated and 7 being full-blown hypertro^y. 

2000 ng/ ml and 200 ng/ ml VEGF-E caused hypertrophy, scored as a 5. 



EXAMPLE 89: Cell Proliferation Assay for PRO200 

Mouse embryonic fibroblast C3mOTl^ cells (ATCC) were grown in 50:50 Ham's F-12: low ^ucose 
DMEM medium containing 10% fetal calf serum (PCS). Cells were plated in dt5>licate in a 24-weIl plate at 
1000, 2000, and 4000 cells/well. After 48 hours, cells were swi&died to medium containing 2% PCS and were 
incubated for 72 hours wi(h 200, 800, or 2000 ng/ml VEGF-E or no growth factor added. 

Approximately 1.5 fold greater niunber of cells were measured in the presence of 200 ng/ml VEGF-E 
as in its absence, at all three cell densities. 



10 



EXAMPLE 90: Endothelial Cell Survival Assay for PRO20Q 
Z Human umbilical vein endothelial cells (miVPr., Cell Systems) wctr maint^mp d m r!ompl<tP Media 

^ (Cell Systems) and plated in triplic^ in serum-fiee medium (Basic Media from Cell Systrans containing 0. 1 % 

BSA) at 20,000 cells/well of a 48-well plate. Cells were incubated for 5 d^ with 200 or 400 ng/ml 
1 15 VEGF-E-]feG, 100 ng/ml VEGF, 20 ng/ml basic FGF, or no addition. 

* Survival was 2-3 times greater with VEGF-B as con^jared to lack of growth factor addition. VEGF 

and basic FGF were included as positive controls. 



|20 EXAMPLE 91: Isolation of cD NA Clones Encoding Human PR0285 

p Aproprietary expressed sequence tag (EST) DNA database (LIFESEQ™, hicyte Pharmaceuticals, Palo 

Alto, CA) was searched and an EST (#2243209) was identified whidi showed homology to ibe DrosophOa Toll 
protem. 

Based on the EST, a pak of PGR primers (forward and reverse): 
25 TAAAGACCCAGCTGTGACCG (SEQ ID NO:499) 

ATCCATGAGCCTCTGATGGG (SEQ ID NO: 500), and 
a probe: 

ATTTATGTCTCGAGGAAAGGGACTGGTTACCAGGGCAGCCAGTTC (SEQ ID NO: 501) 
were synthesized. 

30 niRNA for construction of the cDNA Kbraries was isolated from human placenta tissue. ThecDNA 

libraries used to isolate dae cDNA clones were constructed by standard methods using commercially available 
reagents such as those from Livitrogai, San Diego, CA (Fast Track 2). The cDNA was primed with oligo dT 
containing a NotI site, linl^ wifli blunt to Sail hamUdnased adaptors, cleaved with Not!, sized appropriately 
by gel electrophoresis, and cloned in a defined orientation into the cloning vector pCR2.1 (Ihvitrogen, Inc.) 

35 using reagents and protocols from Life Technologies, Gaithersburg, MD (Siqper Scr^t Plasmid System). The 
double stranded cDNA was sized to greater flian 1000 bp and the cDNA was cloned into BamHI/NotI cleaved 
vector. pCR2. 1 is a commercially available plasmid, designed for easy cloning of PGR fragments, that carries 
AmpR and KanR genes for selection, and LacZ gene for blue-white selection. 
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la order to screen several libraries for a source of a Ml-lengtfa clone, DNA from the libraries was 
screened by PCR amplification with the PCR pruner pair identified above. A positive library was tben used to 
isolate clones encoding the PRC)285 gene using fbe probe oligonucleotide and one of the PCR. primers. 

A cDNA clone was sequraced in entire^. The entire nucleotide sequoice of DNA4Q021-11S4 
(encoding PRQ285) is shown in Figure 208 (SEQ ID NO:495). Clone DNA40021-1 154 contains a single opai 
reading firame with an J?q)arent translational initiation site at nucleotide positions 61-63 (Figure 208). The 
predicted polypeptide precursor is 1049 amino acids long, including a putative signal peptide at amino acid 
positions 1-29, a putative transmembrane domain b^eea amino acid positions 837-860, and a leucine zq>per 
pattern at amino acid positions 132-1S3 and 704-72S, respectively. It is noted that the indicated boundaries are 
approximate, and tbe actual limits of the indicated regions might differ by a few amino acids. Clone DNA40021- 
1 1S4 has been deposited with ATCC (designation: DNA40021-1 154) and is assigned ATCC d^sit no.209389. 

Based on a BLAST and FastA sequooce alignment analyst (using the ALIGN computer program) of 
the fiill-laigth sequence is a human analogue of tbe DrosopMla Toll protein, and is homologous to the following 
human Toll proteins: Tolll (DNAXi? HSU88540-1, which is identical with flie random se<pjenced full-length 
cDNA#HUMRSC786-l); Toll2 (DNAX# HSU88878-1); Toll3 (DNAXj? HSU88879-1); and Toll4 (DNAX# 
HSU88880-1). 



EXAMPLE 92: Isolation of cDNA Clones Kicodins Human PR0286 

A ptoprieiaiy caressed sequem% tag (EST) DNA database (UFESEQ™, Inc^ Phannac«iticals, Palo 
Alto, CA) was searched and an EST (j5f694401) was identified which showed homolosr to the DrosophUa Toll 
proton. 

Based on the EST, a pair of PCR primers (forward and reverse): 
GCCGAGACAAAAACGTTCTCC (SEQ ID NO:502) 

CATCCATGTTCTCATCCATTAGCC (SEQ ID NO: 503), and 
a probe: 

TCGACAACCTCATGCAGAGCATCAACCAAAGCAAGAAAACAGTATT (SEQ ID NO: 504) 
were synOesized. 

mRNA for construction of the cDNA libraries was isolated from human placenta tissue. This RNA was 
used to generate an oligo dT primed cDNA library in the vector pRK5D using reagents and protocols firom Life 
Technologies, Gaithersburg, MD (Super Script Plasmid System). pRK5D is a cloning vector that has an sp6 
transcription initiation site followed by an Sfil restriction en2yme site preceding the XhoI/NotI cDNA cloning 
sites. The cDNA was primed with oligo dT containing a NotI site, linked with blunt to SaH hemikinased 
ad^tors, cleaved with NotI, sized to greater than 1000 bp appropriately by gel electrophoresis, and cloned in 
a defined orientatioa into XhoI/Notl-cleaved pRK5D. 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0286 gene usmg the probe oligonucleotide identified above and one of the PCR 



307 



primers. 

A cDNA clone was sequenced in entirety. The entire nucleotide sequence of DNA42663-1154 
(encoding PR0286) is shown in Figure 210 (SEQ ID NO:497). Clone DNA42663-1 154 contains a SBttgle open 
reading fiame with an appanraxt translational initiation site at nucleotide positions 57-59 (Figure 211). The 
predicted polypeptide precursor is 1041 amino acids long, including a putative signal peptide at amino acid 
5 positions 1-26, a potential transmembrane domain at amino acid positions 826-848, and leucine zipper patterns 
at ammo acids 130-151, 206-227, 662-684, 669-690 and 693-614, respectively. It is noted that die indicated 
boundaries are qiproximate, and the actual limits of flie indicated regions might differ by a few amino acids. 
Qone DNA42663-1 154 has been deposited with ATCC (designation: DNA42663-1 154) and is assigned ATCC 
deposit no. 209386. 

10 Based on a BLAST and FastA sequence alignment analysis (using the AIJGN computer program) of 

the fiill-lengai setpience of PR0286, it is a human analogue of the DrosopMla Toll protein, and is homologous 
to the following human Toll proteins: Tolll (DNAX# HSU88540-1, which is identical wifli flhie random 
sequenced fiill-Iength cDNA #HUMRSC786-1); Toll2 (DNAX# HSU88878-1); ToII3 CDNAX# HSU88879-1); 
and Toll4 (DNAX# HSU88880-1). 

15 



K?CAMPLE 93: NF-kB Assay for PR028S and PR0286 

As the Toll proteins signal through the NF-kB pafliway, their biological activity can be tested in an NF- 
20 kB assay, Jn fliis assay Jurkat cells are transientiy transfected using Lqwfectamine rea^nt (GHbco BRL) 
according to the manufacturer's instructions, lug pB2XLuc plasmid, containing NF-KB-driven luclferase gene, 
is contransfected with 1/tg pSRoN ejqiression vector with or without the insert encoding PR0285 or PR0286. 
For a positive control, cells are treated wifli PMA (phorbol myristyl acetate; 20 ng/ml) and PHA 
(phytohaemaglutinin, 2/ig/ml) for three to four hours. Cells are lysed 2 or 3 days later for measurement of 
25 luciferase activity using reagents from Promega. 

EXAMPLE 94 ; bolation of cDNA Clones Encoding Human PR0213-1. PRO1330 and PR01449 

A consKisus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Exan^le 1 above. This consensus sequence is herein designated DNA28735. Based on the DNA28735 

30 consensus sequence, oligonucleotides were synthesized: 1) to identify by PGR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-lengfli coding sequence for 
PR021 3-1 , PRO1330 and/or PR01449. A pair of PGR primers (forward and reverse) were synthesized: 
forward PGR primer 5'-TGGAGCAGCAATATGCCAGCC-3' (SEQIDNO:511) 
reverse PGR primer 5'-TTTTCGAGTCCTGTCGGGTTGG-3' (SEQ ID NO:512) 

35 Additionally, a synflietic oligonucleotide hybridization probe was constructed from flie consensus DNA28735 
sequence which had the following nucleotide sequence: 
hybridization probe 

5'-GGTGAGAGTTGGGAGTGAGATGTGGATGAATGGAGTGCTAGGAGGG-3' (SEQ ID NO:513) 
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In order to scxeea several libraries for a source of a fidl-length clone, DNA from the libraries was 
screened by PGR amplification with the PGR primer pair identified above. A positive library was then used to 
isolate clones encoding die PR0213-1 , PRO1330 and/or PR01449 gene vsiag &e probe oligonudeotide and one 
of the PGR primers. RNA for construction of the cDNA libraries was isolated from human fetal lung tissue. 
DNA sequencing of the clones isolated as described above gave the fuU-Ietiigth DNA sequeiKe encoding 
5 PR0213-1, PRO1330 and/or PR01449 [DNA30943-1-1163-1 (SEQID NO;505). DNA64907-1 163-1 (SEQID 
NO:507) and DNA64908-1163-1 (SEQ ID NO:509), respectively]. 

The entire nucleotide sequences corresponding to DNA30943-1-1 163-1 (SEQIDNO:505), DNA64907- 
1163-1 (SEQ ID NO:507) and DNA64908-1 163-1 (SEQ ID NO:509), respectively. DNA30943-1I63. 
DNA64907-1 163-1 and DNA64908-1 163-1 contain a single open reading frame with an apparent translational 
10 initiation site at nucleotide positions 336-338, 488-490 and 326-328, respectively, and ending at the stop codon 
1 at nucleotide positions 1221-1223, 1307-1309 and 1145-1 147, respectively (Figures 212, 214 and 216). The 
predicted polypq>tide precursor is 295, 273 and 273 amino adds long, respectively (Figures 213, 215 and 217). 
g DNA30943-1-1163-1, DNA64907-1 163-1 and DNA64908-1 163-1 have been deposited wilii ATCX: and are 
assigned ATCC deposit no. 209791 , 203242 and 203243, respectively. 
15 Analysis of flie amino add sequence of tibe fidl-lengfli PR0213-1 polypeptide su^ests that a portion 

J of it possess significant homology to the human growth arrest-spedfic gene 6 protein. More specifically, an 
^ analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced significant homology b^ween the 

PR0213 amino add sequence and the following DsQrhoff sequences, HSMHC3W5A_6 and B48089. 
s Additional analysis of the amino acid sequence of the fidl-IengOi PRO1330 and PR01449 polypeptide 

20 indicates significant identity with notch4. More specifically, an ansdysis of the Dayhoff database (version 35. 130 
SwissProt35)evidencedsignificantidentitybetweenPRO1330andfliefonowingDayhoffsequences, D86566_l 
andNEL_HUMAN. 

EXAMPLE 95: Isolation of cDNA a^nes Fucnriinp Human PR0298 

25 A cDNA isolated in the amylase soeen described m ^aitq>le 2 above is herein designated DNA26832 

(Figure 220; SEQ ID NO:516). The sequence of DNA26832 was flien used to search expressed sequence tag 
(EST) databjBes. The EST databases included public EST databases (e.g., GenBaidO and a proprietary EST 
database CUFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA). The search was performed using the computer 
program BLAST or BLAST2 (Altshul et al. , Mefliods in Enzvmology 266: 469-480 [199^) . Tliose con^jarisons 

30 resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode protems were dustered 
and assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington; ht^://bozeman.mbt.washington.edu/phrap.docs/phrap.html). 

A consensus DNA sequence was ass^nbled relative to other EST sequences using phrap. A consensus 
sequence was d^ermined, whidi was then extended using repeated cycles of BLAST and phrap to extend the 

35 consensus sequence as far as possible using the sources of EST sequences discussed above. The extended 
assembly sequence was designated DNA35861 .Based on the DNA35861 consensus sequence, oligonucleotides 
were synthesized: 1) to identify by PGR a cDNA library that contained flie sequence of interest, and 2) for use 
as probes to isolate a clone of the full-length coding sequence of PR0298. Forward and reverse primers 
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generally range from 20 to 30 nucleotides and are ofEen designed to give a PGR product of about 100-1000 bp 
in lengtli. Hie probe sequence is typically 40-55 bp in lengfii. In some cases, additional oligonucleotides are 
synthesized when die consensus sequence is greater than about l-1.5]d>p. M order to screen several libraries 
for a full-length clone, DNA fiom the libraries was screened by PGR anq>lification, as per Ausubel et al.. 



encoding the gene of interest using the probe oligonucleotide and one of the primer pairs. 

PGR primers (forward and reverse) and a hybridization probe were synthesized: 
forward PGR Primer 1 GAAGGTGATTTGAAAGGTGGGGTG (SEQ ID NO:517) 
forward PGR Primer 2 GGCrGGTATGAAGAATTTCG (SEQ ID NO:518) 
forward PGR primer 3 AGTGGAAGTGGACCTTGGG (SEQ ID NO:519) 
reverse PGR Primer 1 GTGACGTGAAATCrCTGATAGCGC (SEQ ID NO:520) 

hvbridizationpiobel GGCAAAAGCGATTTTGGGAGCAGGAATTCGAATCATGTCTXiTGATGGTGG 
(SEQ ID NO:521) 

Ea order to screai several libraries for a source of a fidl-Iengtb done, DNA from the libraries was 
screened by PGR an^lification with the PGR primer pair identified above. A positive library was flien used to 
isolate clones encoding fbe PR0298 gene using the probe oligonucleotide and one of the PGR primers. 

RNA for construction of the cDNA libraries was isolated from human fetal lung tissue (LIB25). The 
cDNA libraries used to isolated the cDNA clones were constructed by standard methods using commercially 
available reagraits such as those from Invitrog^, San Diego, GA. The cDNA was primed widi oligo dT 
containing a NotI site, linked with blunt to Sail hemikinased adsqptors, cleaved wifli NotI, sized s^ropriatdy 
b> gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (sudi as pRKB or 
pRKD; pRK5B is a precursor of pRKSD that does not contain the Sfil site; see, Hoknes et al.. Science. 
253: 1278-1280 (1991)) in the unique Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above gave the full-leogdi DNA sequence for 
PR0298 (herein designated UNQ261 [DNA39975-1210]) (SEQ ID NO:514), and flie derived protem sequence 
for PR0298 (SEQ ID NO:5I5). 

The «itire nucleotide sequence of UNQ261 (DNA39975-1210) is shown m Figure 218 (SEQ ID 
NO:514). Qone DNA39975-1210 contains a single open reading frame widi an apparent translational initiation 
site at nucleotide positions 375-377. The predicted polypeptide precursor is 364 amino acids long. The protein 
contains four putative transmembrane domains between amino acid positions 36-55 (type n TM), 65-84, 188- 
208, and 229-245, respectively. A putative N-linked glycosylation site starts at amino acid position 253. Li 
addition, the following features have been identified in the protein sequence: cAMP- and cGMP-dependent 
protein kinase phosphorylation site, starting at position 8; N-myristoylation sites starting a position 173 and 262, 
respectively; and a ZP domain between amino acid positions 45-60. Glone DNA39975-1210 has been deposited 
with ATCC (April 21, 1998) and is assigned ATGC deposit no.209783. 

EXAMPLE 96 : Isolation of cDNA Clones Encoding Human PR0337 

A cDNA sequence identified in the amylase screen described in Exan:^>le 2 above is herein designated 
DNA42301 (Figure 223, SEQ ID NO:524). The DNA42301 sequence was then con^ared to other EST 




r, widi the PGR primer pair. A positive library was used to isolate clones 
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sequences using phr^ as described in Bxaasple 1 above and a cansensus sequence designated herein as 
DNA28761 was identified. Based on this consensus sequence, oligonucleotides were synthesized: 1) to identify 
by PGR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of 
the full-length coding sequence. Ih order to screen several libraries for a source of a Ml-loigth clone, DNA 
from the libraries was screened by PGR amplification wifli the PGR primer pair identified above. A positive 
library was then used to isolate clones encoding the PR0337 gene using flie probe oligonucleotide and one of 
the PGR primers. RNA for construction of &e cDNA libraries was isolated from human fetal brain. 

A cDNA clone was sequenced m its endrely. The full lengdi nucleotide sequance of DNA433 16-1237 
is shown m Figure 221 (SEQ ID NO:522). Clone DNA433 16-1237 contains a smgle open readmg frame with 
an apparent translational initiation site at nucleotide positions 134-136 OFiguie 221; SEQ ID NO:522). The 
predicted polypeptide precursor is 344 amnio adds long. Qone DNA43316-1237 has been deposited with 
ATGG and is assigned ATGG deposit no. 209487 

BasedonaBIAST-2andFastAsequ«ice alignment analysis of the fuU-leogfhseq^ 
amioo acid sequence identity to rat neurotrimin ^7%). 

EXAMPUE 97: Isolation of cDNA Glones En codme Human PRO403 
Mroduction : 

Human tbrombopoietm (THPO) is a glycosylated hormone of 3S2 amino adds consisting of two 
domains. The N-terminal domain, sharing 50% similarity to erythropoietin, is responsible for the biological 
activity. The C-terminal re^on is required for secredon. The gene for thrombopoietin (THPO) maps to human 
chromosome 3q27-q28 where the six exons of this gene span 7 kilobase base pairs of genomic DNA (Ghang et 
al.. Genomics 26: 636-7 (1995); Foster etal., Proc. Natl. Acad. Sci. USA 91: 13023-7 (1994); Gumey et al.. 
Blood 85: 981-988 (1995). In order to determine whether there were any genes encoding THPO homologues 
located m close proximity to THPO, genomic DNA fitigments from this region were identified and sequenced. 
Three PI clones and one PAG clones (Genome Systems Lac., St. Louis, MO; cat. Nos. Pl-2535 and PAG-6539) 
enconq>assing the THPO locus were isolated and a 140 kb region was sequenced using the ordered shotgun 
strategy (Ghenet al.. Genomics 17: 651-656 (1993)), coupled with a PGR-based gap filling approach. Analysis 
reveals that the r^on is gene-rich with four additional genes located very close to THPO: tuuKM- iKcrosis factor- 
receptor type 1 associated protein 2 (TRAP2) and elongation mitiation factor gamma (elF40, chloride channel 
2 (GLGN2) and RNA polymerase n subunit hRPB17. While no THPO homolog was found in the region, fomr 
novel genes have been predicted by computer-assisted gene detection (GRAIL)(Xu et al.. Gen. Engin. 16: 241- 
253 (1994), the presence of CpG islands (Cross, S. and Bird, A., Curr. Opin. Genet. & Devel. 5: 109-314 
(1995), and homology to known genes (as detected by WU-BLAST2.0)(Altschul and Gish, Methods Enzymol. 
266: 460-480 (1996) (ht5>://blast. wustI.edu/blast/README.hlml)- 
Procedures : 
PI and PAC clones: 

Hie initial human PI clone was isolated from a genomic PI library (Genome Systems Inc., St. Louis, 
MO; cat. no.: Pl-2535) screened with PGR primers designed from the THPO genomic sequence (A.L. Gumey, 
et al., Blood 85: 981-88 (1995). PGR primers were designed from the end sequences derived from this PI clone 



311 



were ttien used to screen PI and PAC libraries (Genome Systems, Cat. Nos. : PI-2535 & PAC-6539) to identify 
overlq)piDg clones (PACl, pl.t, and Pl.u). ITie 3'-end sequence from PACz was used to define the primers 
used for the screening of a human BAC library (Genome Systrans Inc., St. Louis, MO; Cat. No.: BDTW- 
4533A). 

Ordered Shotgun Strategy: 

5 The Ordered Shotgun Strategy (OSS) (Chen etal.. Genomics 17: 651-656 (1993)) involves the mapping 

and sequencing of large genomic DNA clones with a hierardiical appmasiL The PI or PAC clone was sonicated 
and the fragments su3x;loned into lanibda vector (ABluestar) (Novagen, Inc., Madison, WI; cat. no. 69242-3). 
The lanibda subclone inserts were isolated by long-rai^ PGR (Barnes, W. Proc. Natl. Acad. Sci. USA 91: 
2216-2220 (1994) and the ends sequenced. The lambda-end sequences were overlapped to create a partial m^ 
10 offhe original clone. Those lanibda clones widioverk^ing end-sequences were identified, the insets subclone^ 

Ip. into a plasmid vector (pUC18 or pUC19, Hoefar Phaimada Biotedi, Inc., San Francisco, CA, Cat. Nos. 27- 
4949-01 and 27-4951-01) and the ends of tibe plasmid subclones were sequenced aiul assembled to generate a 
contiguous sequence. This directed sequmcmg strategy minimizes the redundaiKy required while allowing one 

^■^ to scan for and concentrate on interesting regions. 

W15 In ord«: to define better &e THPO locos and to search for other genes related to the hematopoietin 

J family, five genomic clones were isolated fix}m diis r^ion by PGR screening of human PI and PAC libraries 
(Genome System, Inc., CaL Nos.: Pl-2535 and PAC45539). 

The sizes of the genomic fragments are as follows: Pl.t is 40 kb; Pl.g is 70 K); Pl.u is 70 Hy; PACz is 200 

ld>; and BAC.1 is 80 kb. Approximately 75% (140 kb) of the 190 kb genomic DNA region was sequraiced by 
=^20 the Ordered Shotgun Strategy (OSS) (Chen etal.. Genomics 17: 651-56 (1993), and assembled into contigs usmg 
Ip. AutoAssemblerTM (Applied Biosystems, Perkm Ehner, Foster City, CA, cat. no. 903227). The prelimmaiy 

order of these contigs was detemuned by manual analysis. There were 47 contigs the 140 kb region. A PCR- 

based sqpproach to ordering the contigs and fining in &e gaps was enq)loyed. The following summarizes the 

number and sizes of the gaps. The 50 kb of sequence unique to BAC.1 was sequenced by a total shotgun 
25 jq^ach with a ten-fold redundancy. 

Size of gap number 

<50bp 13 

50-150 bp 7 

150-300 bp 7 
30 300-1000 bp 10 

1000-5000 bp 7 

> 5000 bp 2 ((15,000 bp) 

DNA sequencing : 

35 ABI DYE-primerTM chemistry (PE Applied Biosystems, Foster City, CA; Cat. No. : 4021 12) was used 

to end-sequence the lambda and plasmid subclones . ABI DYE-terminaterTM diemistry (PE Applied Biosystems, 
Foster City, CA, Cat. No: 403044) was used to sequence the PCR products with their respective PCR primers. 
The sequences were collected with an ABB77 instrument. For PCR products larger than Ikb, walking primers 
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were used. The sequences of contigs generated by the OSS strategy in AutoAssemblerTM (PE Applied 
Biosystems, Foster City, CA; Cat. No: 903227) and flie gs^filling sequencing trace files were inq)orted into 
SequencherlM (Gene Codes Corp., Ann Arbor, MI) for overlapping and editing. Hie sequences generated by 
tiie total shotgun strategy were assembled using Fhred and Pbn^ and edited using Consed 
(http://chimera.biotech.washington.edu/uwgc^rojects.litni) and GPP (Genome Reconstruction Manager for 
5 Thrap), version 1.2 (http://stork.cellb.bcni.tmc,edu/gQ)/). 
PCR-Based gan fniina Sfratoffy 

Primers were designed based on the 5'- and 3'-end sequenced of eadi contig, avoiding repetitive and 
low quality sequence regions. All primers were designed to be 19-24-mers with 50-70% G/C content. Ol^os 
were synthesized and gpl-purified by standard mediods. 
10 Since die orientation and order of &e contigs were unknown, permutations of the primers were used 

in the amplification reactions. Two PCR Mts were used: first, XL PCR kit (Peildn Hmer, Norwalk, CT; Cat. 
P No.: N8080205), with extension times of i?>proximately 10 minutes; and second, tbe Taq polymerase PCR kit 
© (Qi^en&ic., Valencia, CA; Cat. No.: 201223) was used under high stringency conditions if smeared or multiple 
^1 products were observed wifli the XL PCR kit. The main PCR product from each successful reaction was 

015 extracted firam a 0.9% low meltnigaganjsegd and purified with the Graiede^ 

Mi 

Tl' sequencing, 
s Analysis : 

f2 ^ identification and duiractenzation of coding regkms was carried out as follows: First, repetitive 

sequences were masked using RepeatMasker (A.F.A. Smit & P. Green. 
•# 20 http://fip.geiiome.washington.edu/RM/RM_details.html) whidi screens DNA sequences inPastA format against 
1^ a library of repetitive elonraits and returns a masked query sequence. Repeats not masked were identified by 
con:5>aring the sequence to the GenBank database usipg WUBLAST2,0 [Altschul, S & Gish, W., Methods 
Enzymol. 266: 460-480 (1996); ht^://blast.wustl.edu/blast/README.htnil] and were masked manually. 

Next, known genes were revealed by comparing the genomic regions against Genentech's protein 
25 database using flie WUBLAST2.0 algorithm and then annotated by aligning the genomic and cDNA sequences 
for each gene, respectively, using a Needleman-Wunch (Needleman and Wunsch, J. Mol. Biol. 48: 443-453 
(1970) algorithm to find regions of local identity between sequHices. The strategy results in detection of all 
exons of the five known genes in the region, THPO, TRAP2, elF4g, CLCN2 and hRPB17 (see bdow). 

30 Known genes Map position 

eukaryotic translation initiation factor 4 gamma 3q27-qter 

thrombopoietin 3q26-q27 

chloride channel 2 3q26-qter 

TNF receptor associated protein 2 not previously 

35 RNA polymerase II subunit liRRB17 not previously 

Finally, novel transcription imits were predicted using a number of approaches . CpG islands (S. Cross 
& Bird, A., Cutr. Opin. Genet Dev. 5: 109-314 (1995) islands were used to define promoter regions and were 
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identified as clusters of sites deaved by enzymes recognizing GC-rich, 6 or 8-mer palindromk: sequences (NoH, 
Narl, BssHn, Xhol. CpG islands are usually associated with promoter regions of genes. WUBLAST2.0 
analysis of short gsoomic regions (10-20 kb) versus GenBank revealed matches to ESTs. The individual EST 
sequences (or where possible, their sequence cbiomatogram files) were retrieved and assembled widi Sequencer 
to provide a theoretical cDNA sequence (DNA36443). GRAIL2 (^Com lac. , Knoxville, TN, command line 
5 version for die DEC ali*a) was used to predict a novel exon. The five known genes in flie region served as 
internal controls for the success of the GRAIL algorithm. 
Isolation : 

A partial endodielin converting enzyme-2 (ECE-2) cDNA clone was isolated by first splicing in silico 
the ECE-2 exons predicted in the genomic sequence to generate a putative sequence (DNA36443). An 
10 oligonucleotide probe: GAAGCAGTGCAGCCAGCAGTAGAGAGGCACCTGCTAAGA) (SEQ ID NO:530) 
^ was deigned and used to screen a human fetal small intestine library (LIBllO) and internal PCR primers 
C (36443fl) (ECE2.f:ACGCAGCTGGAGCTGGTCTTAGCA) (SEQ ID NO:531) and (36443rl) (ECE2.r) 
^ (GGTACTGGACCCCTAGGGCCACAA) (SEQ ID NO:532) were used to confirm clones hybridizmg to the 
probe prior to sequendng. One positive clone was obtamed, however this cDNA ODNA49830) represented a 
^ 15 partially spliced transcript containing ^propriately spliced exons 1 toou^ 6, followed by intron 6 sequence. 
The oligo dT primer annealed to a polyA-stretcih within an Alu dement present in intron 6. An additional ECE-2 
cDNA fragment (DNA49831) was obtained by PCR from a human fetal kidney library (LIB227) with primers 
« designed from the presumed cDNA sequence [36443f3: CCTCCCAGCCGAGACCAGTGG (SEQ ID NO:533) 
and 36443r2: GGTCCTATAAGGGCCAAGACC (SEQ ID NO:534)]. This PCR produrt extended from exon 
^20 13 into die 3' untranslated region m exon 18. 

^ A full Iraigth endodielm converting enzyme 2 (ECE-2) cDNA done (DNA55800-1263) wse isolated 

from an oligo-dT-primed human fi^ brain library. RNA from human fetal brain tissue (20 weeks gestation, 
#28300S)(SRC17S) was isolated by guanidine duocyanate and 5 ;ig used to generate double stranded cDNA 
which was cloned into the vector pRKSE. The 3' -primer 

25 (pGACTAGTTCTAGATCGCGAGCGGCCGCCCTTTTTrri-lTilTTT) (SEQ ID NO:535) and die 5 -Imker 
OpCGGACGCGTGGGTCGA) (SEQ ID NO:536) were designed to introduce Xhol and NotI restriction sites. 
Thelibrary was screened widi PCRprimers [36443pcrfl: CGGCCGTGATGGCTGGTGACG (SEQID NO:537) 
and 36443r3: GGCAGACTCCTTCCTATGGG (SEQ ID NO:538)] designed from the partial human ECE-2 
cDNA sequences (DNA49830 and DNA49831). PCR products were cloned into die vector pCR2.1-TOPO 

30 (Invitrogen Corp., Carlsbad, CA, Cat. No. K4500-01) and sequenced wifli DYE-terminalor chemistiy as 
described above. 

EXAMPLE 98 : Northern Blot and in situ RNA Hvbridization Anal^is for PRO403 

Expression of PRO403 mRNA in human tissues was examined by Northern blot analysis. Human 
35 polyA+ RNA blots derived from human fetal and adxilt tissues (Clontech, Palo Alto, CA; Cat. Nos. 7760-1, 
7756-1 and 7755-1) were hybridized to a [32P-a]dATP-labelled cDNA fragments from probe based on the full 
lengtii PRO403 cDNA. Blots were incubated wifli die probes in hybridization buffer (5X SSPE; 2X Denhardt's 
solution; 100 mg/mL denatured sheared sahnon sperm DNA; 50% formamide; 2% SDS) for 18 hours at 42''C, 
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washed to high stringency (0. IXSSC, 0. 1 % SDS, 50»C) and autoradiographed. The blots were developed after 
overnight exposure by pho^horimager analysis (Fuji). 

PRCM03 mRNA transcripts were (tetected. Analysis of the e:^ression pattern showed the strongest 
signal of the expected 3.3 kb transcript in adult brain (highest in the cerd}eUum, putamen, medulla, and temporal 
lobe, and lower in the cerebral cortex, occq)ital lobe and frontal lobe), spmal cord, lung and pancreas and higher 
5 levels of a 4.S Id) transcript m fetal brain and kidney. 

EXAMPLE 99 : Use of PRO Polvpentide-Encodmg Nucleic Acid as Hybridization Probes 

The following method describes use of a nucleotide sequence encoding a PRO polypeptide as a 
hybridization probe. 

10 

DNA con]{>rising the codmg sequence of of a PRO polypeptide of mterest as disclosed herein may be 
employed as a probe or used as a basis from which to prepare probes to screen for homologous DNAs (such as 
'Z. those encoding naturally-occurring variants of the PRO polypeptide) in human tissue cDNA libraries or human 
%. tissue genomic libraries. 

^15 Hybridization and washing of filters containing either library DNAs is performed under the following 

high stringency conditions. HybridizationofradiolabeledPROpolypeptide-encodmgnucleicacid-derivedprobe 
to the filters is performed in a solution of 50% formamide, 5x SSC, 0.1% SDS, 0.1% sodium pyrophosphate, 
50 mM sodium phosphate, pH 6.8, 2x Deohardt's solution, and 10% dextran sulfate at 42'X; for 20 hours. 

|£ Washing of fbe filters is performed in an aqueous solution of 0. Ix SSC and 0. 1 % SDS at 42°C. 

3|.20 DNAs having a deuired sequence identity with tbs DNA encoding fiill-lraigth native sequence PRO 

^ polypq)tide can then be identified using standard tediniques known in the art. 

EXAMPLE 100: Expression of PRO Polvpeptides in E. coU 

This exanq)le illustrates prq>aration of an unglycosylated form of a desired PRO polypeptide by 

25 recombinant expression in E. coU. 

The DNA sequence encodmg die desired PRO polypeptide is initially amplified using selected FOR 
primers. The primers should contain r^trictbn enzyme sites which correspond to ^ restriction enzyme sites 
on the selected expression vector. Avariety of e^^ression vectors may be enq)loyed. An example of a suitable 
vector is pBR322 (derived firom E. coli; see Bolivar et al.. Gene. 2:95 (1977)) which contains genes for 

30 an5)icillin and tetiracyclme resistance. The vector is digested with restriction enzyme and dephosphorylated. 
The PGR amplified sequences are then ligated into the vector. The vector will preferably mclude sequences 
which encode for an antibiotic resistance gene, a trp promoter, a polyhis leader (includmg flie first six STII 
codons, polyhis sequence, and enterokinase cleavage site), the spedflc PRO polypqrtide coding r^on, lambda 
transcriptional terminator, and an argU gene. 

35 The ligation mixture is then used to transform a selected E. coli strain using the methods described in 

Sambrook et al . , supra . Transformants are identified by their ability to grow on LB plates and antibiotic resistant 
colonies are then selected. Plasmid DNA can be isolated and confirmed by restriction analysis and DNA 
sequencing. 
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Selected clones can be grown overnight in liquid culture medium such as LB broth supplemented with 
antibiotics. TTie ovemi^ culture may subsequaitly be used to inoculate a larger scale culture. The cells are 
(h^ grown to a desired optical density, during which die e:q>iession promoter is tamed on. 

After culturing die cells for several more hours, the cells can be harvested by centrifiigation. The cell 
pellet obtained by the centrifiigation can be solubilized i^iog various agents known in tfie art, and die solubilized 
5 PRO polypeptide can Hbea be purified using a metal chdating column under conditions that allow tight binding 
of the protein. 

PR0181, PR0195, PRO200. PR0237, PR0273, PRO540, PR0322, PRO1017, PR0938, PR0162, 
PR01114, PR0827 and PRO1008 were e^ressed in E. cott in a poly-His tagged form, using the following 
procedure. The DNA encoding the PRO polypeptide was initiaUy amplified using selected PGR primers. The 
10 primers contained restriction «izyme sites which correspond to the restriction enzyme sites on tihe selected 
egression vector, and other useflil sequences providing for efiRcient and reliable translation initiation, r^id 
^ purification on a metal chelation column, and proteolytic removal with enterokinase. The PCR-amplified, poly- 
.5" His tagged sequences were dien ligated into an expression vector, whidi was used to transform an E. coU host 
^ based on strain 52 (W3 1 10 fuhA(tonA) Ion galE rpoHtsQi^Rts) clpPOacIq). Transformants were first grown 
"^15 in LB contaming 50 mg/ml carbenidllin at 30°C wifli shaking until an O.D.600 of 3-5 was reached. Cultures 
were feen dauted 50-100 fold into CRAP media (prepared by mixing 3.57 g ^«IH4)2S04, 0.71 g sodram 
= citrate-2H20, 1,07 g KQ, 5.36 g Difco yeast extract, 5.36 g Sheffield hycase SF in 500 mL water, as well as 
'^"I 110 mM MPOS, pH 7.3, 0.55% (w/v) glucose and 7 mM MgS04) and grown for proximately 20-30 hours 
at 30''C with shaking. Sanaples were removed to verify e^iression by SDS-PAGE analysis, and tibe bulk culture 
"^20 is centrifiiged to pellet the cells. Cell pellets were frozen until purification and refolding. 
^ E. coli paste fixjm 0.5 to 1 L fermentations (6-10 g pellets) was resuspended in 10 volumes (w/v) in 7 

M guanidine, 20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodium tetrathionate is added to make final 
concentrations of O.IM and 0.02 M, respectively, and the solution was stirred overnight at 4°C. This step 
results in a denatured protein wifli all cysteine residues blocked by sulfitolization. The solution was centrifiiged 
25 at 40,000 rpm in a Beckman Ultracentifuge for 30 min. The supernatant was diluted wifli 3-5 volumes of metal 
chelate column buffer (6 M guanidine, 20 mM Tris, pH 7.4) and filtered through 0.22 micron filters to clarify. 
Depending the clarified extract was loaded onto a 5 ml Qiagen Ni-NTA metal chelate column equilibrated in the 
metal chelate column buffer. The column was washed wifli additional buffer containing 50 mM imidazole 
(Calbiochem, Utrol grade), pH 7.4. Hie protein was eluted with buffer containing 250 mM imidazole. 
30 Fractions containing the desired protein were pooled and stored at 4°C. Protein concoitration was estimated 
by its absorbance at 280 nm using the calculated extinction coefGcient based on its amino acid sequence. 

The proteins were refolded \sy diluting sample slowly into fl:eshly prepared refolding buffer consisting 
of: 20 mM Tris, pH 8.6, 0.3 M NaQ, 2.5 M urea, 5 mM t^steine, 20 mM gjycme and 1 mM EDTA. 
Refolding volumes were diosen so that the final protein concentration was between 50 to 100 micrograms/ml. 
35 The refolding solution was stinted gently at 4°C for 12-36 hours. The refolding reaction was quenched by flie 
addition of TFA to a final concentiration of 0.4% (pH of approximately 3). Before further purification of the 
protein, the solution was filtered through a 0.22 micron filter and acetonitrile was added to 2-10% final 
concentiration. Hie refolded protein was du:omatogr£5>hed on a Poros Rl/H reversed phase column using a 
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mobile buffer of 0.1% TFA with elution with a gradient of acetonitrile firam 10 to 80%. Aliquots of firactions 
witiii A280 absoibauce wore analyzed on SDS polyacrylamide gels and fractions containing homogeneous 
refolded protein wae pooled. Generally, fbs properly refolded species of most protemts are eluied at tfie lowest 
concentrations of acetonitrile since those species are the most compact widi their hydrophobic interiors shielded 
from interaction wilh tbs reversed phase resin. Aggregated species are usually eluted at higher acetonitrile 
S concentradoiK. In addition to resolving misfolded forms of proteins from the desired form, the reversed phase 
st^ also removes en(k>toxin fix>m the sanq>les. 

Fractions containing the desired folded PRO proteins were pooled and the acetonitrile r^noved usmg 
a gentle stream of nitrogen durected at the solution. Proteins were formulated into 20 mM Hepes, pH 6.8 wi& 
0. 14 M sodium chloride and 4% mannitol by dialysis or by gel filtration using G25 Siq>erfine (Pharmacia) resins 
10 equilibrated in the formulation buffer and sterile filtered. 

Many of the PRO polypeptides described herein were successfully e^ressed as described above. 

i 

EXAMPLE 101: Expression of PRO PolvpepHfiPg in MammaliflTi rells 
^ This exan^le illustrates preparation of a glycosylated form of a deshred PRO polypeptide by 

'C 15 recombinant e}q>ression in mammalian cells. 

' * The vector, pRK5 (see EP 307,247, published March 15, 1989), is enployed as the expression vector. 

Optionally, the PRO poIypqptide-«ncoding DNA is ligated mto pRK5 with selected restriction enzymes to allow 
insertion of flie PRO polypeptide DNA using ligation methods such as described in Sambrook et al., supra. The 

4 resulting vector is called pRK5-PR0 polypeptide. 

^20 In one embodiment, the selected host cens may be 293 cells. Human 293 cells (ATCCCCL 1573) are 

grown to confluence in tissue culture plates in medium sudi as DMBM siq^Iemented vfiSi fstal calf serum and 
optionally, nutrient components and/or antibiotics. About 10 ng pRK5-PRO polypq)tide DNA is nnxed with 
about 1 fig DNA encoding the VA RNA gene piumm^jpaya et al., O^, 31:543 (1982)] and dissolved in 500 
fdofl mM Tris-HCl, 0.1 mM EDTA, 0.227 M CaQj. To fliis mixture is added, dropwise, 500 of 50 mM 

25 HEPES (pH 7.35), 280 mM NaCl, 1.5 mM NaP04, and a precipitate is allowed to form for 10 minutes at25°C. 
The precipitate is suspended and added to die 293 ceUs and allowed to setde for about four hours at 37°C. The 
culture medium is aspirated off and 2 ml of 20% glycerol in PBS is added for 30 seconds. The 293 cells are 
then washed with serum free medium, fresh medium is added and the cells are incubated for about 5 days. 

Approximately 24 hours after the transfections, the culture medium is removed and replaced with culture 

30 medium (alone) or culture medium containing 200 ftCi/ml ^*S-cysteine and 200 /tCi/ml ^S-mefhionine. After 
a 12 hour incubation, the conditioned medhmi is collected, concentrated on a spin filter, and loaded onto a 15% 
SDS gel. The processed gel may be dried and esqwsed to fihn for a selected period of time to reveal the 
presence of PRO polypqptide. The cultures containing transfected cells may undergo further mcubation (in 
serum free medium) and the medium is tested in selected bioassays. 

35 In an alternative technique, PRO polypeptide may be introduced into 293 cells iransienlly using the 

dextran sulfate method described by Somparyrac et al., Proc. Natl. Acad. Sci. . 12:7575 (1981). 293 cells are 
grown to maximal density in a spinner flask and 700 fig pRK5-PRO polypepti<te DNA is added. The cells are 
first concentrated from the spinner flask by centrifixgation and washed wifli PBS. The DNA-dextran prec^itate 
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In order to screen several libraries for a source of a ftdl-lengfli clone, DNA from the libraries was 
screened by PGR amplification with the PGR prmier pair identified above. A positive library was then used to 
isolate clones encoding the PR0213-1 , PRO1330 and/or PR01449 gacie using the probe oligonucleotide aod one 
of the PGR primers. RNA for construction of the cDNA libraries was isolated from human fetal lung tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence encoding 
PR0213-1, PRO1330 and/or PR01449 [DNA30943-1-1163-1 (SEQIDNO:505), DNA64907-1163-1 (SEQID 
NO:507) and DNA64908-1163-1 (SEQ ID NO:509), respectively]. 

The entire nucleotide sequences corTespondingtoDNA30943-l-1163-l (SEQIDNO:505), DNA64907- 
1163-1 (SEQ ID NO:507) and DNA64908-1 163-1 (SEQ ID NO:509), respectively. DNA30943-1163. 
DNA64907-1 163-1 and DNA64908-1163-1 contain a single open reading frame with an ^parent translational 
initiation site at nucleotide positions 336-338, 488-490 and 326-328, respectively, and ending at tbs stop codon 
at nucleotide positions 1221-1223, 1307-1309 and 1145-1147, respectively (Figures 212, 214 and 216). The 
predicted polypeptide precursor is 295, 273 and 273 amino acids long, respectively (Figures 213, 215 and 217). 
DNA30943-1-1 163-1, DNA64907-1163-1 and DNA64908-1163-1 have been deposited with ATCC and are 
assigned ATCG deposit no. 209791, 203242 and 203243, respectively. 

Analysis of the amino add sequMtce of the full-leqgth PR0213-1 polypeptide sugg^ that a portion 
of it possess significant homology to the human growth arrest-specific gene 6 protein. More specifically, an 
analysis of flie Dayhofif database (version 35.45 SwissProt 35) evidenced significant homology between the 
PR0213 amino add sequence and the following Dayhoff sequences, HSMHC3W5A_6 and B48089. 

Additional analysis of fbe amino add sequence of the full-l«igth PRO1330 and PR01449 polypqptide 
uidicates significant identity with notch4. Morespedficafly, an analysis ofthe Dayhoff database (version 35. 130 
SwissProt 35) evidenced significant identity betweenPRO1330 and the following Dayhoff sequences, D86566_l 
and NEL_HUMAN. 

EXAMPLE 95: Isolation of cDNA Qones Bicodmg Human PR0298 

A cDNA isolated in the amylase screai described in Example 2 above is herein designated DNA26832 
(Figure 220; SEQ ID NO:516). The sequence of DNA26832 was then used to search expressed sequence tag 
(EST) databases. The EST databases included public EST databases (e.g., GenBank) and a proprietary EST 
database (UFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA). The seardi was performed usmg the computer 
programBLAST orBLAST2 (Altshul etal.. Methods in Enzvmologv 266 : 469-480 [1996]). Those comparisons 
resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode ptotems were clustered 
and assembled into consensus DNA sequences with the program "pfarap" (Phil Green, University of Washington, 
Seattle, Washington; htlp://bozeman.nibL washington.edu/phrap.docs/phrap.htmI). 

A consensus DNA sequence was assembled relative to oflier EST sequences using pfarap. A consensus 
sequence was determined, which was thai extended using repeated cycles of BLAST and phrap to extend die 
consensus sequaice as far as possible using the sources of EST sequences discussed above. The extended 
assembly sequaice was designated DNA35861 .Based on the DNA35861 consensus sequence, oligonucleotides 
were synthesized: I) to identify by PGR a cDNA hbrary that contained the sequence of interest, and 2) for use 
as probes to isolate a clone of the full-length coding sequence of PR0298. Forward and reverse primers 
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generally range from 20 to 30 nucleotides and are often ds&igped to give a PGR product of about 100-1000 bp 
in length. The probe sequence is ^pically 40-SS bp in length. In some cases, additional oligonucleotides are 
synthesized wbea the consensus sequence is greater than about l-1.5]d)p. In ord^ to screen several libraries 
for a full-length done, DNA from the libraiies was screened by PGR ancplification, as per Ausubel et al.. 
Current Protocols in Mo lecular Biology , with the PGR primer pair. A positive library was used to isolate clones 
encodiqg the gene of interest using the probe oligonucleotide and one of the primer pairs. 

PGR primers (forward and reverse) and a hybridization probe were synthesized: 
forward PGR Primer 1 CAAGGTGATTTGAAAGGTGGGGTG (SEQ ID NO:517) 
forward PGR Primer 2 GGCrGGTATGAAGAATITCG (SEQ ID NO:518) 
forward PGR nrimer 3 AGTGGAAGTGGAGGTGCG (SEQ ID NO:519) 
reverse PGR Primer 1 GTGACGTGAAATGTCTGATAGGGG (SEQ ID NO:520) 

hybridization probe 1 GGGAAAAGGCATTTTGGGAGCAGGAATTGGAATCATGTCTGTGATGGTGG 
(SEQIDNO:521) 

In order to screen several libraries for a source of a fiill-length clone, DNA from the libraries was 
screened by PGR amplification wi& the PGR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0298 gene using the probe oligonucleotide and one of the PGR primers. 

RNA for construction of Ihe cDNA libraries was isolated from human fetal lung tissue (LIB25). The 
cDNA libraries used to isolated the cDNA clones were constructed by standard methods usiiig commercially 
available reagents such as those firom hivitrogen, San Diego, GA. The cDNA was primed with oUgo dT 
containii^ a NotI site, linked with blunt to Sail hemildnased adj^rs, cleaved with NotI, sized ^ropriately 
by gel electrophoresis, and cloned m a defined orientation into a suitable cloning vector (sudi as pRKB or 
pRKD; pRK5B is a precursor of pRKSD that does not contain the Sfil site; see, Hotaes et al.. Science. 
2^:1278-1280 (1991)) m the unique Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above gave the fidl-length DNA sequence for 
PR0298 Oierein designated UNQ261 [DNA39975-1210]) (SEQ ID NO:514), and flie derived protem sequence 
for PR0298 (SEQ ID NO:515). 

Hie entire nucleotide sequence of UNQ261 (DNA39975-1210) is shown m Figure 218 (SEQ ID 
NO:514). Glone DNA39975-1210 contains a single open reading frame with an sq^arenttranslational initiation 
site at nucleotide positions 375-377. The predicted polypeptide precursor is 364 amino acids long. The protein 
contams four putative transroambrane domains between amino acid positions 36-55 (type n TM), 65-84, 188- 
208, and 229-245, respectively. A putative N-linked glycosylation site starts at amino acid position 253. In 
addition, the following features have been identified in the protein sequence: cAMP- and cGMP-dependent 
protem kinase phosphorylation site, starting at position 8; N-myristoylation sites starting a position 173 and 262, 
respectively; and a ZP domain between amino acid positions 45-60. Clone DNA39975-1210 has been deposited 
wifli ATCC (April 21, 1998) and is assigned ATCC deposit no.209783. 

EXAMPLE 96 : Isolation of cDNA Clones Encoding Human PRQ337 

A cDNA sequence identified in the amylase screen described in Example 2 above is herein designated 
DNA42301 (Figure 223, SEQ ID NO:524). The DNA42301 sequraice was then compared to other EST 
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sequences using phr^ as described in £xanq>le 1 above and a consensus sequKice designated herein as 
DNA28761 was identified. Based on this consensus sequence, oligonucleotides were synthesized: 1) to identify 
by PGR a cDNA library that contaxoed ihe sequence of interest, and 2) fiar use as probes to isolate a done of 
die full-Iraigth coding sequence. In order to screen several libraries for a source of a fidl-lengdi clone, DNA 
from the libraries was screened by PGR amplification with the PGR printer pair identified above. A positive 
5 library was fliMi used to isolate clones encoding the PR0337 gene using the probe oligonucleotide and one of 
the PGR primers. RNA for construction of the cDNA libraries was isolated from human fetal brain. 

A cDNA done was sequenced in its entirety. ITie fbll l«igth nucleotide sequence of DNA433 16-1237 
is shown in Figure 221 (SEQ ID NO:522). Clone DNA43316-1237 contanis a single open reading frame witihi 
an a^fpareat translational initiation site at nucleotide positions 134-136 figure 221; SEQ ID NO:522). The 
10 predicted polypeptide precursor is 344 amino adds long. Qone DNA433 16-1237 has been deposited with 
^ ATCG and is assigned ATGC deposit no. 209487 

C Based on a BLAST-2 and FastA sequence alignment analysis of the full-length sequence, PR0337 shows 

'i^^' amino acid sequence identity to rat neurotrimin (97%). 

Il5 EXAMPLE 97: Jsolation of cDNA Oones Encoding Human PRO403 
Introduction: 

% Human throinbopoietin (THPO) is a glycosylated hormone of 352 amino adds consisting of two 

fi^^ domains. The N-temdnal domain, sharing 50% similarity to erythropoietin, is responsible for the biological 
Ifii activity. Ihe C-terminaL region is required for secretion. Ilie gene for thrombopoietin (THPO) m^ to human 
pJiZO duTomosome 3q^-q28 where flie six exons of this gene span 7 kdlobase base paks of genomic DNA (Chang et 
IpS, al.. Genomics 26: 636-7 (1995); Fostar et al., Proc. Nad. Acad. Sd. USA 91: 13023-7 (1994); Gumey et al.. 
Blood 85: 981-988 (1995). In order to determine whether there were any genes encoding THPO homologoes 
located in close proximiQr to THPO, goiomic DNA fragments from this region were identified and sequenced. 
Three PI clones and one PAG clones (Genome Systems Inc. , St. Louis, MO; caL Nos. Pl-2535 and PAC-6539) 
25 encon^assing the THPO locus were isolsrted and a 140 kb region was sequenced using die ordered shotgun 
strategy (Chen et al. , Genomics 17: 651-656 (1993)), coupled with a PCR-based gjap filling apptoach. Analysis 
reveals that die region is gene-rich wifli four additional genes located very close to THPO: tumor necrosis factor- 
receptor type 1 Msociated protein 2 (TRAP2) and elongation initiation factor gamma (elF40, chloride channel 
2 (GLCN2) and RNA polymerase IT subunit hRH817. While no THPO homolog was found in the region, four 
. 30 novel genes have been predicted by computer-assisted gene detection (GRAIL)(Xu et al.. Gen. Engin. 16: 241- 
253 (1994), the presence of CpG islands (Cross, S. and Bird, A., Curr. Opin. Genet. & DeveL 5: 109-314 
(1995), and homology to known graies (as detected by WU-BLAST2.0)(AItschul and Gish, Methods Bti2ymol. 
266: 460-480 (1996) (htlp://blast. wusd.edu/blast/README.html). 
Procedures : 
35 PI and PAC clones: 

TTie initial human PI clone was isolated firom a g^omu; PI library (Gmame Systems Lie. , St. Louis, 
MO; cat. no. : Pl-2535) screened with PCR primers designed fi:om the THPO genomic sequence (A.L. Gumey, 
et al. , Blood 85: 98 1-88 (1995) . PCR primers were designed from die end sequences derived fi-om this PI clone 
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were then used to screen PI and PAC libraries (Genome Systems, Cat. Nos.: Pl-2535 & PAC-6539) to identify 
overlapping clones (PACl, pl.t, and Pl.u). The 3'-end sequwice ftom PACz was used to define the primers 
used for the screening of a human BAG library (Genome Systems Inc., St. Louis, MO; Cat. No.: BDTW- 
4533A). 

Ordered Shotgun Strategy; 

5 The Ordered Sho^jm Strategy (OSS) (Chenet al.. Genomics 17: 651-656 (1993)) involves the mapping 

and sequencing of large genomic DNA clones with a hierarchical s^roach. The PI or PAC clone was sonicated 

and the fragmsnis stibcloned into lambda vector (XBluestar) (Novagen, Inc., Madison, WI; cat. no. 69242-3). 

The lambda subclone inserts were isolated by long-range PGR ^ames, W. Proc. Natl, Acad. Sci. USA 91: 

2216-2220 (1994) and flie ends sequenced. The lambda-end sequences were overlapped to create a partial m^ 

10 of the original clone. Those lambda clones with overlappmg end-sequoices were identified, die insets subcloned 

into a plasmid vector (pUC18 or pUC19, Hoefer Pharmacia Biotech, Joe,, San Francisco, CA, Cat. Nos. 27- 

^ 4949-01 and 27-4951-01) and the aids of the plasmid subclones were sequenced and assanbled to generate a 

^ contiguous sequence. This directed sequoicii^ strategy nrinimizes the redundancy required while aU^ 

to scan for and concoitrate on interesting regions. 

mfi 

15 In order to define better the THPO locus and to search for odier genes related to the hematopoietin 

^ family, five genomic clones were isolated fixim this r^ion by PGR so^eening of human PI and PAG libraries 

(Genome System. Inc., Gat. Nos.: Pl-2535 and PAG-6539). 
;2 "nie sizes of the genomic fiagmmts are as follows: Pl.t is 40 kb; Pl.g 70 kb; Pl.u is 70 kb; PACz is 200 

kb; and BAG.l is 80 kb. Approximately 75% (140 Kb) of the 190 kb genomic DNA region was sequenced by 
= 20 aieOrderedShotgHnStrategy(OSS)(Chmetal.,Gen0micsl7:651-56(1993),andaKembledintocontigsusing 
H**' AutoAssemblerTM (Applied Biosystems, Perkin Ebner, Foster City, CA, cat- no. 9032^7). The preliminary 

order of these condgs was determined by manual analysis. There were 47 contigs the 140 kb region. A PGR- 

based qiproadi to ordering the contigs and filling in the gaps was en^loyed. Hie following summarizes die 

number and sizes of the gaps. Hie 50 kb of sequence unique to BAC.1 was sequenced by a total shotgun 
25 approadi widi a ten-fold redundancy. 

Size of gap number 

<50bp 13 

50-150 bp 7 

150-300 bp 7 
30 300-1000 bp 10 

1000-5000 bp 7 

> 5000 bp 2 ((15,000 bp) 

DNA sequencing : 

35 ABIDYE-primerTMchemistry(PEAH»liedBiosystrans,FosterGily,GA;Cat. No.: 402112) was used 

to end-sequence the lambda andplasmid subclones . ABI DYE-terminaterTM diemistry (PE Applied Biosystems, 
Foster City, CA, Cat. No: 403044) was used to sequence the PGR products with their respective PGR primers. 
TTie sequences were collected with an ABI377 instrument For PGR products larger flian Ikb, walking primers 
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were used. Tbs sequences of cootigs generated by the OSS sttategy in AutnAssembletTM (PE Applkd 
Biosystems, Foster Cily, CA; Cat. No: 903227) and the g^filling sequencing trace files were inqrarted into 
SequencherTM (Ckne Codes Corp., Ann Arbor, MI) for overlapping and editing. The sequences generated by 
the total shotgun strategy were assembled using Ebred and Pbrap and edited ising Consed 
(http://chimera.biote(±.washingtoii.edu/uwgc/projects.fatni) and GFP (Genome Reconstruction Manager for 
5 Hir^), version 1.2 (ht^://stork.cellb.bcm.tmc.edu/g^/)- 

PCR-Rased oap fniin«r f^fratepy- 

Primers were designed based on the 5'- and 3'-end sequenced of each contig, avoiding repetitive and 
low quality sequence regions. All primers were designed to be 19-24-mers wifli 50-70% G/C content. Oligos 
were synthesized and gel-purified by standard methods. 
10 Since the orientation and order of the contigs were unknown, permutations of the primers were iKed 

in the an^jlification reactions. Two PC3J. kits were used: first, XL PGR kit (Perkin Ehner, Norwalk, CT; Cat. 
^ No.: N8080205), with ext^ion times of approximately 10 minutes; and second, ttie Taq polymerase PGR kit 
' (Qiagenlnc., Valencia, CA; Cat, No.: 201223) was used under high stringency conditions if smeared or multiple 
products were observed with the XL PGR kit. ITie main PGR product fix)m each successful reaction was 
15 extracted from a 0.9% low melting agarose gel and purified wifli the Geneclean DNA Purification kit prior to 
sequencing. 
£ Analysis ; 

^ The identification and characterization of coding regions was carried out as follows: First, rq>etitive 

sequences were masked using RepeatMasker (A.F.A. Smit & P. Green, 
'^'^ 20 bttp://fip.genon]e.wasbit^gton.edu/RM/KN(_detaiIs.btDiI) which screens DNA sequences in FastA format against 
^ a library of rqietitive clexaeats and returns a masked query sequence. Repeats not masked were identified by 
conqraring the sequence to die GenBank dat^ase using WUBLAST2.0 [Altschul, S & Gish, W., Mefliods 
Enzymol. 266: 460-480 (1996); ht^://blast. wustl.edu/blast/README.html] and were masked manually. 

Next, known genes were revealed by conq>aring the genomic regions against Goientech's protein 
25 database using the WUBLAST2.0 algorithm and then annotated by aligning die genonuc and cDNA sequences 
for each gene, respectively, using a Needleman-Wundi (Needleman and Wunsch, J. Mol. Biol. 48: 443-453 
(1970) algoridmi to find regions of local identity between sequences. The strategy results in detection of all 
exons of the five known genes in the region, THPO, TRAP2, elF4g, CLCN2 and hRPB17 (see below). 

Map position 
3q27-qter 
3q26-q27 
3q26-qter 

not previously mqjped 
not previously mapped 



51) Known genes 

eukaiyotic translation initiation &ctor 4 gamma 

thrombopoietin 

chloride channel 2 

TNF receptor associated protein 2 
35 RNA polymerase II subunit hRFB17 



Finally, novel transcription units were predicted using a number of approaches. CpG islands (S. Cross 
& Bird, A., Curr. Qpin. Genet. Dev. 5: 109-314 (1995) islands were used to define promoter regions and were 
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identified as clusters of sites cleaved by enzymes recognizing GC-rich, 6 or S-tner palindromic sequences (NotI, 
Narl, BssHn, Xhol. CpG islands are usually associated witib promoter regions of genes. WUBLAST2.0 
analysis of short genomic re^ons (10-20 Kb) versus GenBank revealed matches to ESTs. The individual EST 
sequm^ (or where possible, their sequence chromatogram files) were retrieved and assembled wilh Sequencer 
to provide a theoretical cDNA sequence (DNA36443). GRAIL2 (ApoCom Inc. , Khoxville, TN, command line 
5 version for the DEC alpha) was used to predict a novel exon. The five known genes m the region served as 
internal controls for the success of the GRAIL algorithm. 
Isolation : 

A partial aidothelin convertmg enzyme-2 (ECE-2) cDNA clone was isolated by first splicing in silico 
the ECE-2 exons predicted in the genomic sequence to generate a putative sequence (DNA36443). An 
10 oligonucleotide probe: GAAGCAGTGCAGCCAGCAGTAGAGAGGCACCTGCTAAGA) (SEQ ID NO:530) 
was designed and used to screen a human fetal small intestine library (LEBllO) and mtemal PGR primes 
£ (36443fl) (ECE2.f:ACGCAGCTGGAGCTGGTCTTAGCA) (SEQ ID NO:531) and (36443rl) CECE2.r) 

Jj (GGTACTGGACCCCTAGGGCCACAA) (SEQ ID NO:532) were used to confirm clones hybridiang to the 
^- probe prior to sequencing. One positive clone was obtained, however this cDNA 0>NA49830) rq»resented a 
pl5 partially spliced transCTqrt containiiig ^jpropriately g>I^ 

pi TheoligodTprimerannealedtoapolyA-stretehwithinanAluelementpreseaatinmtron6. An additional ECE-2 
^ cDNA fragment (DNA4983 1) was obtamed by PCR ftom a human fetal kidney library (LIB227) wifh primers 
designed fi-om the presumed cDNA sequence [36443f3: CCTCCCAGCCGAGACCAGTGG (SEQ ID NO:533) 
and 36443r2: GGTCCTATAAGGGCCAAGACC (SEQ ID NO:534)]. This PCR product extended fmm exon 
^20 13intod]e3'imtTanslatediegioninexonl8. 

^ A fijll length endothelin converting enzyme 2 (ECE-2) cDNA done (DNA55800-1263) was isolated 

from an oligo-dT-primed human fetal brain library. RNA from human fietal brain tissue ^0 weeks gestation, 
#283005)(SRC175) was isolated by guanidme Ihiocyanate and 5 fig used to graierate double stranded cDNA 
which was cloned into the vector pRK5E. The 3' -primer 

25 (pGACTAGTTCTAGATCGOjAGCGGCCGCCCri'ril'l'ri'i'iTi'i'iT) (SEQ ID NO:535) and tiie 5 -linker 
(pCGGACGCGTGGGTCGA) (SEQ ID NO:536) were designed to introduce Xhol and NotI restriction sites. 
The library was screened with PCR primers [36443pcrfl: CGGCCGTGATGGCTGGTGACG(SEQIDNO:537) 
and 36443r3: GGCAGACTCCTTCCTATGGG (SEQ ID NO:538)l designed fix>m the partial human ECE-2 
cDNA sequences (DNA49830 and DNA49831). PCR products were cloned into Ihe vector pCR2.1-TOPO 

30 (lavitrogen Corp., Carlsbad, CA, Cat. No. K4500-01) and sequenced with DYE-terminator chemistry as 
described above. 

EXAMPLE 98: Northern Blot and m situ RNA H vhriHi^tinn A talvsis for PRO403 

Expression of PRO403 mRNA in human tissues was examined by Nordiem blot anal}rsis. Human 
35 polyAH- RNA blots derived from human fetal and adult tissues (Qontech, Palo Alto, CA; Cat. Nos. 7760-1, 
7756-1 and 7755-1) were hybridized to a [32P-a]dATP-labelled cDNA firagments from probe based on flie Ml 
length PRO403 cDNA. Blots were incubated with the probes in hybridization buffer (5X SSPE; 2X Denhardt's 
solution; 100 mg/mL denatured sheared sahnon sperm DNA; 50% formamide; 2% SDS) for 18 hours at 42°C, 
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washed lo high stringency (0. IXSSC, 0. 1 % SDS, 50°C) and autoradiographed. The blots were developed afier 
overnight e;qx)sure by phosphorimagpr analysis (Fuji). 

PRO403 mRNA transcripts were detected. Analysis of the egression pattern showed the stroi^est 
signal of the expected 3.3 kb transcript in adult brain (highest in the cerebeUum, putamen, medulla, and ten^ral 
lobe, and lower in the cerebral cortex, occq>ital lobe and frontal lobe), spinal cord, lung and pancreas and hi^ 
5 levels of a 4.S kb transcript in fetal brain and kidney. 

EXAMPLE 99 : Use of PRO Polvpeotid&-Encodin f^ Nucleic Acid as Hybridization Probes 

The following method describes use of a nucleotide sequence encoding a PRO polypepti<fe as a 
hybridization probe. 

10 

DNA conq)rising the coding sequence of of a PRO polypqitide of interest as disclosed herein may be 
enqiloyed as a probe or used as a basis from which to prepare probes to sami for homologous DNAs (such as 
those encoding naturally-occurring variants of flie PRO polypq)tide) m human tissue cDNA libraries or human 
tissue genomic libraries. 

15 Hybridization and washii^ of filters containii^ either library DNAs is performed under the following 

high stringency conditions. Hybridizationof radiolabeledPROpolypq>tide-encoding nucleic acid-derived probe 
to the filters is performed in a solution of 50% formamide,5xSSC, 0.1% SDS, 0.1% sodium pyrophosphate, 
50 mM sodium phosphate, pH 6.8, 2x Denhardt's solution, and 10% dextran sulfiite at 4TC for 20 hours. 
Washing of die filters is performed in an aqueous solution of O.lx SSC and 0.1 % SDS at 4TC. 

20 DNAs having a desired sequence identity with the DNA encoding fidl-lengfli native sequence PRO 

polypeptide can then be identified mmg standard techniques known in the art 

EXAMPLE 100 ; Expression of PRO Polyoeptides in E. coU 

This example illustrates preparation of an ungjycosylated form of a desired PRO polypeptide by 

25 reconibinant expression in E. coff. 

The DNA sequence encoding the desired PRO polypeptide is initially anplified usmg selected PGR 
primers. The pcimeis should contain restriction enzyme sites which correspond to the restriction enzyme sites 
on the selected egression vector. A variety of e5q>ression vectors may be en^jloyed. An example of a suitable 
vector is pBR322 (derived from E. coU; see BoUvar et al., Gaie, 2:95 (1977)) which contains genes for 

30 an^icillm and tetraQrcline resistance. The vector is digested wifli restriction enzyme and dephosphorylated. 
The PGR amplified sequences are dien ligated into the vector. The vector will preferably include sequences 
which encode for an antibiotic resistance gene, a trp promoter, a polyhis leader (including the first six STU 
codons, polyhis sequence, and enterokinase cleavage site), the specific PRO polypeptide codmg region, lambda 
transcriptional terminator, and an argU gene. 

35 The ligation mixture is then used to transform a selected E. coli strain using the methods described in 

Sambrook et al. , suora. Transformants are identified by dieir abihty to grow on LB plates and antibiotic resistant 
colonies are then selected. Plasmid DNA can be isolated and confirmed by restriction analysis and DNA 
sequencing. 



315 



Selected clones can be grown oveniight in liquid culture medium such as LB broth supplemented with 
antibiotics. The ovenii^t culture may subsequently be used to inoculate a larger scale culture. The cells are 
tiiea grown to a desired optical density, during which die e^^ression promoter is turned on. 

After culturing the cells for several more hours, the cells can be harvested by centrifugation. The cell 
pellet obtained by the centrifugation can be solubilized using various agaits known in the art, andflie solubilized 
PRO polypq)tide can then be purified usmg a metal dielating column under conditions that allow tight binding 
of the protein. 

PR0181, PR0195, PRO200, PR0237, PR0273, PRO540, PR0322, PRO1017, PR0938, PR0162, 
PR01114, PR0827 and PRO1008 were expressed in E. cott m a poly-His tagged form, using the followmg 
procedure. The DNAeacoding the PRO polypq>tide was initiaUy amplified using selected PGR primers. The 
primers contained restriction enzyme sites which correspond to the restriction enzyme sites on Hoe selected 
egression vector, and other useful sequences providing for efficient and reliable translation initiation, r^id 
purification onametal chelation column, and proteolytic removal withenterokinase. The PC».-amplified, poly- 
His ta^ed sequences were then ligated into an expression vector, which was used to transform an E. coU host 
based on strain 52 (W3110 fiifaA(tonA) Ion gaffi rpoHts(h^Rts) clpPOadq). Traosformants were first grown 
in LB contaimng 50 mg/ml carbemcillin at SO'C with shaking until an O.D.600 of 3-5 was reached. Cultures 
were then diluted 50-100 fold mto CRAP media (prepared by mixhig 3.57 g 0^2804, 0.71 g sodium 
citrate-2H20, 1 .07 g KCi, 5.36 g Difco yeast extract, 5.36 g Sheffield hycase SF in 500 mL water, as wdl as 
110 mM MPOS, pH 7.3, 0.55% (w/v) glucose and 7 mM M^O^ and grown for proximately 20-30 hours 
at 30*Cwifli shaking. Sanq>les were removed to verify e;^ession by SDS-PAGE analysis, and the bulk culture 
is centrifuged to pellet the cells. Cell pellets were fix>zen until purification and reiblding. 

E. coU paste fi»m 0.5 to 1 L fermentations (6-10 g peUets) was resuspended in 10 volumes (w/v) in 7 
M guanidine, 20 mM Tris, pH 8 buftisr. Solid sodium sulfite and sodium tetralhionate is added to make final 
concentrations of O.IM and 0.02 M, respectively, and ihe solution was stirred overnight at A^C. This step 
results in a denatured protem wifli all cysteme residues blocked by sulfitolization. The solution was centrifuged 
at 40,000 rpm in a Beckman Ultracentifuge for 30 mm. The supernatant was diluted with 3-5 volumes of metal 
chelate column buffer (6 M guanidine, 20 mM Tris, pH 7.4) and filtered through 0.22 micron filters to clarify. 
Dq)ending fiie clarified extract was loaded onto a 5 ml Qiagen Ni-NTA m^ chelate column equilibrated m the 
metal dbelate column buffer. The column was washed with additional buffer containing 50 mM imidazole 
(Calbiodiem, Utrol grade), pH 7.4. The protein was eluted wifli buffer containing 250 mM imida2X)le. 
Fractions containing the desired protein were pooled and stored at 4**C. Protein concentration was estimated 
by its absorbance at 280 nm using the calculated extinction coefficient based on its ammo acid sequence. 

The proteins were refolded by diluting sample slowly into fi«shly prepared refolding buJ^ consisting 
of: 20 mM Tris, pH 8.6, 0.3 M NaCl, 2.5 M urea, 5 mM cysteme, 20 mM glycine and 1 mM EDTA. 
Refolding volumes were chosen so that the final protein concentration was b^een 50 to 100 micrograms/ml. 
The refolding solution was stirred gently at 4°C for 12-36 hours. The refolding reaction was quench«l by the 
addition of TEA to a final concentration of 0.4% (pH of approximately 3). Before further purification of the 
protein, the solution was filtered through a 0.22 micron filter and acetonitrile was added to 2-10% final 
concentration. The refolded protein w^ chromatographed on a Poros Rl/H reversed phase colunm using a 
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